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TITLE OF THE INVENTION 

MELANIN CONCENTRATING HORMONE RECEPTOR CHIMERIC AND 
FUSION PROTEINS 

5 CROSS-REFERENCE TO RELATED APPLICATIONS 

The present application claims priority to provisional application U.S. 
Serial No. 60/189,698, filed March 15, 2000, hereby incorporated by reference herein. 

BACKGROUND OF THE INVENTION 

10 The references cited herein are not admitted to be prior art to the 

claimed invention. 

Neuropeptides present in the hypothalamus play a major role in 
mediating the control of body weight. (Flier et al, 1998. Cell, 92, 437-440.) Melanin- 
concentrating hormone (MCH) is a cyclic 19-amino acid neuropeptide synthesized as 

1 5 part of a larger pre-prohormone precursor in the hypothalamus which also encodes 
neuropeptides NEI and NGE. (Nahon et al., 1990. Mol Endocrinol 4, 632-637.) 
MCH was first identified in salmon pituitary, and in fish MCH affects melanin 
aggregation thus affecting skin pigmentation. In trout and in eels MCH has also been 
shown to be involved in stress induced or CRF-stimulated ACTH release. (Kawauchi 

20 et al., 1983. Nature 305, 321-323.) 

In humans two genes encoding MCH have been identified that are 
expressed in the brain. (Breton et al., 1993. Mol Brain Res. 18, 297-310.) In 
mammals MCH has been localized primarily to neuronal cell bodies of the 
hypothalamus which are implicated in the control of food intake, including perikarya 

25 of the lateral hypothalamus and zona inertia. (Knigge et al., 1996. Peptides 17, 1063- 
1073.) 

Pharmacological and genetic evidence suggest that the primary mode 
of MCH action is to promote feeding (orexigenic). MCH mRNA is up regulated in 
fasted mice and rats and in the ob/ob mouse. (Qu et al, 1996. Nature 380, 243-247.) 
30 Injection of MCH centrally (ICV) stimulates food intake and MCH antagonizes the 
hypophagic effects seen with ot-melanocyte stimulating hormone (aMSH). (Qu et 
al., 1996. Nature 380, 243-247.) MCH-deficient mice are lean, hypophagic, and have 
increased metabolic rate. (Shimada et al, 1998. Nature 396, 670-673.) 

MCH action is not limited to modulation of food intake as effects on 
35 the hypothalamic-pituitary-axis have been reported. (Nahon 1994. Critical Rev. in 
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Neurobiol 8, 221-262.) MCH may be involved in the body response to stress as 
MCH can modulate the stress-induced release of CRF from the hypothalamus and 
ACTH from the pituitary. In addition, MCH neuronal systems may be involved in 
reproductive or maternal function. 
5 Several references describe a receptor that is indicated to bind MCH. 

(Chambers et al. 9 1999. Nature 400, 261-265; Saito et al 9 1999. Nature 400, 265-269; 
BSchner et aL, 1999. FEBS Letters 457:522-524; Shimomura et aL, 1999. 
Biochemical and Biophysical Research Communications 261, 622-626; and Lembo et 
aL, 1999. Nat. Cell Biol. 1, 267-271.) 

10 

SUMMARY OF THE INVENTION 

The present invention features melanin concentrating hormone receptor 

(MCH-R) chimeric and fusion proteins. MCH-R chimeric proteins comprise an 

MCH-R polypeptide region made up of at least two or more polypeptide regions 
15 characteristic of MCH-R found in different species. MCH-R fusion proteins comprise 

an MCH-R polypeptide region and a fluorescent protein region. 

An MCH-R polypeptide region provides a functional G-protein 

coupled receptor region able to bind MCH and transduce an intracellular signal. 

Examples of MCH-R polypeptide regions include naturally occurring MCH-R, 
20 chimeric MCH-R containing two or more regions from naturally occurring MCH-R, 

and functional derivatives thereof. 

Reference to the terms "characteristic" and "derivatives thereof 

describe a relationship to a reference sequence. In both cases, there is at least about 

75% sequence similarity to the reference sequence. 
25 Thus, a first aspect of the present invention describes a fusion protein 

comprising (a) an MCH-R polypeptide region and (b) a fluorescent polypeptide 

region. The fluorescent polypeptide region is joined directly, or though a polypeptide 

linker, to the carboxy side of the MCH-R polypeptide region. 

Another aspect of the present invention describes an MCH-R chimeric 
30 protein. The protein comprises: (a) an MCH-R binding region characteristic of a 

human MCH-R, (b) a transmembrane domain characteristic of a non-human MCH-R, 

and (c) an intracellular domain characteristic of a non-human MCH-R. 

Another aspect of the present invention describes a nucleic acid 

encoding for an MCH-R fusion protein or an MCH-R chimeric protein described 
35 herein. Such nucleic acid comprises either a contiguous nucleotide sequence that 



WO 01/68706 PCT/US01/08071 



codes for the protein or a sequence that is processed by a host cell to produce a 
contiguous nucleotide sequence encoding for the protein. Processing of a nucleic acid 
sequence to produce a contiguous nucleotide sequence encoding for a protein can 
occur by the splicing together of exons resulting in intron removal. 

5 Another aspect of the present invention describes an expression vector 

comprising a nucleic acid encoding for an MCH-R fusion protein or an MCH-R 
chimeric protein described herein. 

Another aspect of the present invention describes a recombinant cell 
comprising nucleic acid encoding for an MCH-R fusion protein or an MCH-R 

10 chimeric protein described herein. The nucleic acid may be part of the host genome 
or may exist independently of the host genome. 

Another aspect of the present invention describes a non-human 
transgenic animal comprising nucleic acid encoding for an MCH-R fusion protein or 
an MCH-R chimeric protein described herein. 

15 Another aspect of the present invention describes a method for 

assaying for MCH-R active compounds by measuring the effect of a test preparation 
on one or more MCH-R activities. The method is performed using either an MCH-R 
fusion protein or an MCH-R chimeric protein described herein. 

Other features and advantages of the present invention are apparent 

20 from the additional descriptions provided herein including the different examples. 
The provided examples illustrate different components and methodology useful in 
practicing the present invention. The examples do not limit the claimed invention. 
Based on the present disclosure the skilled artisan can identify and employ other 
components and methodology useful for practicing the present invention. 

25 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 illustrates aequorin assay results comparing a mouse MCH-R 
fusion with a human wild type MCH-R and a CMV-EGFP control. 

Figure 2 illustrates a cAMP flashplate assay of CHO cell clones stably 

30 expressing mMCH-lR-EGFP. Cells from individual clones were dissociated in 
enzyme free media and stimulated for 15 minutes at 37°C with human MCH at the 
indicated concentrations in the presence of 10 jiM forskolin. Cells were then lysed 
and assayed for bound [ 125 I]cAMP. Mouse MCH-1R-EGFP clones exhibited EC50 
values (0.1111, 0.1255, 0.1291, or 0.2304 nM) indistinguishable from that of a CHO 

35 cell clone expressing the wild-type human short isoform of MCH-1R ( 0.1282 nM). 
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Figure 3 illustrates a cAMP flashplate assay of CHO cell clones stably 
expressing human short/mouse species chimeric MCH-1R-EGFP. Cells from 
individual clones were dissociated in enzyme free media and stimulated for 15 
minutes at 37°C with human MCH at the indicated concentrations in the presence of 
5 lOuMforskolin. Cells were then lysed and assayed for bound [ 125 I]cAMP. Human 
short/mouse species chimeric MCH-1R-EGFP clones exhibited EC50 values (0.0366, 
0.0462, 0.21 17, or 0.2499 nM) indistinguishable from that of a CHO cell clone 
expressing the wild-type human short isoform of MCH-1R (0.1137 nM). 

10 DETAILED DESCRIPTION OF THE INVENTION 

The present invention features MCH-R chimeric and fusion proteins. 
Such proteins have a variety of different uses including being used as a research tool 
to study MCH-R function and dynamics, and being used to screen for MCH-R 
agonists and antagonists. 

15 The MCH-R provides a target to achieve different beneficial effects in 

a patient. Preferably, MCH-R activity is modulated to achieve one or more of the 
following: weight loss, weight gain, treat cancer {e.g., colon or breast), reduce pain, 
treat diabetes, reduce stress, or teat sexual dysfunction. 

Modulation of MCH-R activity can be achieved by evoking a response 

20 at the MCH receptor or by altering a response evoked by an MCH receptor agonist or 
antagonist. Compounds modulating MCH-R receptor activity include agonists, 
antagonists, and allosteric modulators. Generally, MCH-R antagonists and allosteric 
modulators negatively affecting activity will be used to achieve weight loss, treat 
cancer {e.g., colon or breast), reduce pain, reduce stress, or teat sexual dysfunction; 

25 and MCH-R agonists and allosteric modulators positively affecting activity will be 
used to produce a weight gain. 

Preferably, MCH-R activity is modulated to achieve a weight loss or to 
treat diabetes in a patient. Diabetes mellitus can be treated by modulating MCH-R 
activity to achieve, for example, one or both of the following: enhancing glucose 

30 tolerance or decreasing insulin resistance. 

Excessive body weight is a contributing factor to different diseases, 
including hypertension, diabetes, dyslipidemias, cardiovascular disease, gall stones, 
osteoarthritis, and certain forms of cancers. Bringing about a weight loss can be used, 
for example, to reduce the likelihood of such diseases and as part of a treatment for 

35 such diseases. Weight reduction can be achieved by modulating MCH-R activity to 
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obtain, for example, one or more of the following effects: reducing appetite, 
increasing metabolic rate, reducing fat intake, or reducing carbohydrate craving. 

Increasing body weight is particularly useful for a patient having a 
disease or disorder, or under going a treatment, accompanied by weight loss. 
5 Examples of diseases or disorders accompanied by weight loss include anorexia, 
AIDS, wasting, cachexia, and frail elderly. Examples of treatments accompanied by 
weight loss include chemotherapy and radiation therapy. 

MCH-R Oiimeric Proteins 

10 MCH-R chimeric proteins contain an MCH-R polypeptide region made 

up by at least two or more polypeptide regions characteristic of MCH-R found in 
different species. The different polypeptide regions that are present provide for an N- 
terminal extracellular domain; a transmembrane domain made up of transmembrane 
regions, extracellular loop regions, and intracellular loop regions; and an intracellular 

15 carboxy terminus domain. Examples of MCH-R amino acid sequences include the 
following: SEQ. ID. NO. 1 (human MCH1R long form), SEQ. ID. NO. 2 (human 
MCH1R short form), and SEQ. ID. NO. 3 (mouse MCH1R). 

Preferably, the MCH-R chimeric protein comprises an MCH-R binding 
region characteristic of a human MCH-R along with transmembrane and intracellular 

20 domains characteristic of a non-human MCH-R. There are substantial amino acid 

differences between the N-terminus of the MCH-R present in humans and that present 
in other species such as mice. Such differences could result in, for example, the 
mouse MCH-R having different intrinsic properties and responsiveness to agonists 
and/or antagonists than the human MCH-R. The presence of a human MCH-R 

25 binding region provides for a "humanized" MCH-R chimeric receptor. 

The transmembrane and intracellular domains characteristic of a non- 
human MCH-R can be used in conjunction with a non-human host to provide a more 
naturally occurring environment for these regions. For example, an MCH-R chimeric 
having mouse transmembrane and intracellular domains are preferably used in murine 

30 cells lines or in transgenic mice. 

MCH-R chimeric proteins may contain regions other than extracellular, 
transmembrane, and intracellular domains that do not substantially decrease the 
activity of the protein. Preferably, additional regions do not cause a decrease of more 
than about 25% of MCH-R activity as measured using one or more of the assays 
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described in the examples provided below. Examples of additional regions that may 
be present include fluorescent protein regions and linker regions. 

In an embodiment of the present invention, the MCH-R chimeric 
protein comprises: (a) an MCH binding region characteristic of a first species and (b) 

5 a transmembrane and intracellular domain region characteristic of a second species 
joined directly, or though a linker, to the carboxy side of the MCH binding region. 
Preferably, the protein comprises, consists, or consists essentially of an MCH-R 
polypeptide having a sequence similarity of at least about 75%, at least 85%, or at 
least 95% with either SEQ. ID. NO. 4 (human short form/mouse species chimeric 

10 MCH1R) or SEQ. ID. NO. 5 (human long form/mouse species chimeric). Even more 
preferably, the protein comprises, consists essentially of, or consists of, SEQ. ID. NO. 
4orSEQ.ID.N0.5. 

Sequence similarity for polypeptides can be determined by BLAST. 
(Altschul et at, 1997. Nucleic Acids Res. 25, 3389-3402, hereby incorporated by 

15 reference herein.) In an embodiment of the present invention, sequence similarity is 
determined using tBLASTn search program with the following parameters: 
MATRIX:BLOSUM62, PER RESIDUE GAP COST: 11, and Lambda ratio: l/ 

Differences in naturally occurring amino acids are due to different R 
groups. An R group effects different properties of the amino acid such as physical 

20 size, charge, and hydrophobicity. Amino acids can be divided into different groups as 
follows: neutral and hydrophobic (alanine, valine, leucine, isoleucine, proline, 
tryptophan, phenylalanine, and methionine); neutral and polar (glycine, serine, 
threonine, tyrosine, cysteine, asparagine, and glutamine); basic (lysine, arginine, and 
histidine); and acidic (aspartic acid and glutamic acid). 

25 Generally, in substituting different amino acids it is preferable to 

exchange amino acids having similar properties. Substituting different amino acids 
within a particular group, such as substituting valine for leucine, arginine for lysine, 
and asparagine for glutamine are good candidates for not causing a change in 
polypeptide functioning. 

30 Changes outside of different amino acids groups can also be made. 

Preferably, such changes are made taking into account the position of the amino acid 
to be substituted in the polypeptide. For example, arginine can substitute more freely 
for nonpolor amino acids in the interior of a polypeptide then glutamate because of its 
long aliphatic side chain. (See, Ausubel, Current Protocols in Molecular Biology, 

35 John Wiley, 1987-1998, Supplement 33 Appendix 1C.) 
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MCH-R Fusion Proteins 

MCH-R fusion proteins contain an MCH-R polypeptide region and a 
fluorescent protein region either directly joined together or joined together through a 
5 linker. These regions provide MCH-R activity and a marker for evaluating MCH-R 
dynamics. 

An MCH-R polypeptide region provides functional MCH-R activity 
and includes naturally occurring MCH-R, chimeric MCH-R, and derivatives thereof. 
Preferred derivatives thereof have a sequence similarity of at least about 75%, at least 

10 about 85%, or at least about 95% to a naturally occurring MCH-R or a chimeric 
MCH-R described herein. 

A fluorescent protein region contains a chromophobe that fluoresces. 
Preferably, the fluorescent protein region is the green fluorescent protein of the 
jellyfish Aequorea victoria or a derivative thereof. Preferred derivatives have a 

15 sequence similarity of at least about 75%, at least about 85%, or at least about 95% to 
the Aequorea victoria green fluorescent protein (GFP). The Aequorea victoria green 
fluorescent protein and examples of derivatives thereof are described by Cormack et 
aL, 1996. Gene 17, 33-38; Yang etal, 1996. Nucleic Acids Research 24, 4592-4593; 
Tsien et aL, U.S. Patent No. 5,625,048; Tsien et aL, U.S. Patent No. 5,777,079; and 

20 Cormack et aL, U.S. Patent No. 5,804,387 (each of which are hereby incorporated by 
reference herein). 

In different embodiments the MCH-R polypeptide region comprises, 
consists essentially of, or consists of, a sequence selected from the group consisting 
of: SEQ. ID. NO. 1, SEQ. ID. NO. 2, SEQ. ID. NO. 3, SEQ. ID. NO. 4, and SEQ. ID. 

25 NO. 5; and the fluorescent polypeptide region comprises, consists essentially of, or 
consists of, an amino acid sequence selected from the group consisting of SEQ. ID. 
NO. 6 (GFP), SEQ. ID. NO. 7 (EGFP), SEQ. ID. NO. 8 (Emerald), SEQ. ID. NO. 9 
(Topaz), and SEQ. ID. NO. 10 (Wlb). EGFP, Emerald, Topaz, and Wlb are 
derivatives of GFP. 

30 The optionally present linker is a polypeptide region that is preferably 

from 1 to about 100 amino acids in length. In different embodiments the linker is up 
to 75, 50 or 25 amino acids in length. 

Preferably, the MCH-R fusion protein comprises, consists essentially 
of, or consists of, the MCH-R polypeptide region and the fluorescent polypeptide 

35 region. More preferably, the protein comprises, consists essentially of, or consists of, 
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an amino acid sequence selected from the group consisting of: SEQ. ID. NO. 11 
(mouse MCHlR-linker-EGFP), SEQ. ID. NO. 12 (mouse MCH1R/EGFP direct 
fusion), SEQ. ID. NO. 13 (human short form/mouse species chimeric MCHlR-linker- 
EGFP), or SEQ. ID. NO. 14 (human long form/mouse species chimeric MCH1R- 
5 linker-EGFP). 

MCH-R Chimeric and Fusion Proteins Nucleic Acid and Expression 



techniques well known in the art. Preferably, such proteins are produced by 
10 recombinant expression inside a host cell by way of an expression vector or by way of 

nucleic acid integrated into the host genome. Examples of nucleic acid sequences 

encoding for MCH-R polypeptide regions, fluorescent protein regions, MCH-R 

chimeric proteins, and MCH-R fusion proteins are provided for by SEQ. ID. NOs. 15- 

29 (see Example 1, infra). 
15 Starting with a particular amino acid sequence and the known 

degeneracy of the genetic code, a large number of different encoding nucleic acid 

sequences can be obtained. The degeneracy of the genetic code arises because almost 

all amino acids are encoded for by different combinations of nucleotide triplets or 

codons. The translation of a particular codon into a particular amino acid is well 
20 known in the art {see, e.g., Lewin GENES IV, p. 1 19, Oxford University Press, 1990). 

Amino acids are encoded for by codons as follows: 

A=Ala=Alanine: codons GCA, GCC, GCG, GCU 

C=Cys=Cysteine: codons UGC, UGU 

D=Asp=Aspartic acid: codons GAC, GAU 
25 E=Glu=Glutamic acid: codons GAA, GAG 

F=Phe=Phenylalanine: codons UUC, UUU 

G=Gly=Glycine: codons GGA, GGC, GGG, GGU 

H=His=Histidine: codons CAC, CAU 

I=Ile=Isoleucine: codons AUA, AUC, AUU 
30 K=Lys=Lysine: codons AAA, AAG 

D=Leu=Leucine: codons UUA, UUG, CUA, CUC, CUG, CUU 

M=Met=Methionine: codon AUG 

N=Asn=Asparagine: codons AAC, AAU 

P=Pro=Proline: codons CCA, CCC, CCG, CCU 
35 Q=Gln=Glutamine: codons CAA, CAG 



MCH-R chimeric and fusion proteins can be produced using 
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R=Arg=Arginine: codons AGA, AGG, CGA, CGC, CGG, CGU 

S=Ser=Serine: codons AGC, AGU, UCA, UCC, UCG, UCU 

T=Thr=Threonine: codons ACA, ACC, ACG, ACU 

V=Val=Valine: codons GUA, GUC, GUG, GUU 
5 W=Trp=Tryptophan: codon UGG 

Y=Tyr=Tyrosine: codons UAC, UAU 

Examples of techniques for introducing nucleic acid into a cell and 

expressing the nucleic acid to produce protein are provided in references such as 

Ausubel, Current Protocols in Molecular Biology, John Wiley, 1987-1998, and 
10 Sambrook, et aL , in Molecular Cloning, A Laboratory Manual, 2 nd Edition, Cold 

Spring Harbor Laboratory Press, 1989. 

An expression vector contains recombinant nucleic acid encoding for a 

polypeptide along with regulatory elements for proper transcription and processing. 

The recombinant nucleic acid contains two or more nucleic acid regions not naturally 
15 associated with each other. Exogenous regulatory elements such as an exogenous 

promoter can be useful for expressing recombinant nucleic acid in a particular host. 

Examples of expression vectors are cloning vectors, modified cloning vectors, 

specifically designed plasmids, and viruses. 

Generally, the regulatory elements that are present in an expression 
20 vector include a transcriptional promoter, a ribosome binding site, a terminator, and 

an optionally present operator. Another preferred element is a polyadenylation signal 

providing for processing in eukaryotic cells. Preferably, an expression vector also 

contains an origin of replication for autonomous replication in a host cell, a selectable 

marker, a limited number of useful restriction enzyme sites, and a potential for high 
25 copy number. 

Expression vectors providing suitable levels of polypeptide expression 
in different hosts are well known in the art. Mammalian expression vectors well 
known in the art include pcDNA3 (Invitrogen), pMClneo (Stratagene), pXTl 
(Stratagene), pSG5 (Stratagene), EBO-pSV2-neo (ATCC 37593), pBPV-l(8-2) 
30 (ATCC 37110), pdBPV-MMTneo(342-12) (ATCC 37224), pRSVgpt (ATCC 37199), 
pRSVneo (ATCC 37198), pSV2-dhfr (ATCC 37146), pUCTag (ATCC 37460), pCI- 
neo (Promega) and .lambda.ZD35 (ATCC 37565). Bacterial expression vectors well 
known in the art include pETlla (Novagen), lambda gtll (Invitrogen), pcDNAII 
(Invitrogen), and pKK223-3 (Pharmacia). Fungal cell expression vectors well known 



-9- 



WO 01/68706 PC17US01/08071 



in the art include pYES2 (Invitrogen) and Pichia expression vector (Invitrogen). 
Insect cell expression vectors well known in the art include Blue Bac IE (Invitrogen). 

Recombinant host cells may be prokaryotic or eukaryotic. Examples 
of recombinant host cells include the following: bacteria such as E. coli; fungal cells 

5 such as yeast; mammalian cells such as human, bovine, porcine, monkey, hampster, 
and rodent; and insect cells such as Drosophila and silkwonn derived cell lines. 
Commercially available mammalian cell lines include L cells L-M(TK.sup.-) (ATCC 
CCL 1.3), Lcells L-M (ATCC CCL 1.2), 293 (ATCC CRL 1573), Raji (ATCC CCL 
86), CV-1 (ATCC CCL70),.COS-1 (ATCC CRL 1650), COS-7 (ATCC CRL 1651), 

10 CHO-K1 (ATCC CCL 61), 3T3 (ATCC CCL 92), NIH/3T3 (ATCC CRL 1658), 
HeLa (ATCC CCL 2), C127I (ATCC CRL 1616), BS-C-1 (ATCC CCL 26) and 
MRC-5 (ATCC CCL 171). 

To enhance expression in a particular host it may be useful to modify 
the sequence to take into account codon usage of the host. Codon usage of different 

15 organisms are well known in the art, (See, Ausubel, Current Protocols in Molecular 
Biology, John Wiley, 1987-1998, Supplement 33 Appendix 1C.) 

Expression vectors may be introduced into host cells using standard 
techniques. Examples of such techniques include transformation, transfection, 
lipofection, protoplast fusion, and electroporation. 

20 Nucleic acid encoding for a polypeptide can be expressed in a cell 

without the use of an expression vector employing, for example, synthetic mRNA or 
native mRNA. Additionally, mRNA can be translated in various cell-free systems 
such as wheat germ extracts and reticulocyte extracts, as well as in cell based systems, 
such as frog oocytes. Introduction of mRNA into cell based systems can be achieved, 

25 for example, by microinjection. 

Techniques for producing transgenic animals are well known in the art. 
Examples of such techniques are provided for by Teratocarcinomas and embryonic 
stem cells: a practical approach. Ed. By E. J. Robertson, IRL Press limited, Oxford, 
England (1987); and Gene Targeting: a practical approach. Ed. By A. L. Joyner, 

30 Oxford University Press Inc. New York, NY (1993). 



G-Protein Coupled Receptor Assays 

MCH-R is G-protein coupled receptor. Techniques for measuring 
different G-protein activities, such as Gi/o, Gs, and Gq are well known in the art. 
35 MCH-R activity is preferably assayed for by measuring either Gi/o or Gq. 
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Gi/o and Gs activity can be measured using techniques such as a 
melonaphore assay, measuring cAMP production, measuring inhibition of cAMP 
accumulation, and measuring binding of 35S-GTP. cAMP can be measured using 
different techniques such as radioimmunoassay and indirectly by cAMP responsive 
5 gene reporter proteins. 

Gq activity can be measured using techniques such as those measuring 
intracellular Ca 2+ . Examples of techniques well known in the art that can be 
employed to measure Ca 2+ include the use of dyes such as Fura-2 and the use of Ca 2+ - 
bioluminescent sensitive reporter proteins such as aequorin. An example of a cell line 
10 employing aequorin to measure G-protein activity is HEK293/aeql7. (Button et 
al., 1993. Cell Calcium 14, 663-671 . and Feighner et al, 1999. Science 284, 2184- 
2188, both of which are hereby incorporated by reference herein.). 

Functional assays can be performed using individual compounds or 
preparations containing different compounds. A preparation containing different 
15 compounds where one or more compounds affect MCH-R chimeric or fusion protein 
activity can be divided into smaller groups of compounds to identify the compound(s) 
affecting MCH-R chimeric or fusion protein activity. In an embodiment of the 
present invention a test preparation containing at least 10 compounds is used in a 
functional assay, 

20 Functional assays can be performed using recombinant!/ produced 

MCH-R chimeric or fusion protein present in different environments. Such 
environments include, for example, cell extracts and purified cell extracts containing 
the MCH-R chimeric or fusion protein expressed from recombinant nucleic acid and 
an appropriate membrane for the polypeptide; and the use of a purified MCH-R 

25 chimeric or fusion protein produced by recombinant means that is introduced into a 
different environment suitable for measuring G-protein activity. 

Fluorescent Protein Assays 

Fluorescent protein joined to an MCH receptor can be employed to 
30 study different aspects of receptor dynamics including receptor sequestration, receptor 
densitization, and receptor localization. The fluorescent protein can be used in in 
vitro or in vivo systems. 

In vitro applications of fluorescent proteins can be performed using 
techniques well known in the art. Examples of such techniques are provided by Barak 
35 et al., 1997. Mol Pharm. 5, 177-184; Tarasova et al. , 1997. /. Biol Chem. 272, 
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14817-14824; Lin et al, 1998. Mol Cell Endo. 146, 27-37; Tarasova et al, 1998. J. 

Biol. Chem. 273, 15883-15886; KaM etal., 1998. /. Biol Chem. 273, 322-328; 

Groake et al, 1999. /. Biol Chem. 274, 23263-23269; Doherty et al, 1999. 

Biochem. J. 341, 415-422; Brock et al, 1999. Proc. Natl Acad Sci. USA 96, 10123- 
5 10128; Cornea et al, 1999. Endocrinology 140, 4272-4280; and Lembo et al., 1999. 

Nat. Cell Biol 1, 267-271 (these references are not admitted to be prior art to the 

claimed invention). 

In vivo applications of fluorescent proteins can be performed using 

techniques well known in the art. Examples of such techniques are provided by 
10 Mombaerts et al., 1996. Cell 87, 675-686; Rodriquez et al., 1999. Cell 97, 199-208; 

Spergel et al., 1999. J. Neurosci. 1, 2037-2050; and Zuo et al, 1999. Proc. Natl 

Acad. Sci. USA 96, 14100-14105 (these references are not admitted to be prior art to 

the claimed invention). 

15 EXAMPLES 

Examples are provided below to further illustrate different features and 
advantages of the present invention. The examples also illustrate useful methodology 
for practicing the invention. These examples do not limit the claimed invention. 

20 Example 1: 

Amino acid and nucleic acid sequence information for SEQ. ID. NOs. 
1-29 are provided below. SEQ. ID. NOs. 1-29 include examples of polypeptide and 
encoding nucleic acid sequences for MCH-R polypeptide regions, fluorescent 
polypeptide regions, fusion proteins and chimeric proteins. In some cases the 
25 encoding nucleic acid is shown with additional nucleic acid upsteam or downstream 
from an open reading frame. 

SEQ. ID. NO. 1: Human long form MCH1R 

MSVGAMKKGVGRAVGIXjGGSGCQATEED 
30 AWVEGSSARLWEQATGTGWMD^ 
TGSISYINIIMPSWGTICll^ 

YIATVHPISS1T0FRKPSVATLVICIX 
RIJNPDTDLYWFTLY^ 
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TTOmilTAIAICLVFFVCW^^ 
CUWVYIVLCETOIEIILVLSVCT^ 

SEQ. ID. NO. 2: Human short form MCH1R 

5 MDI^SIJJTCPNASOTSDGPDNLTSAGSPPRTGSISY 
NSTVIFAWKKSKIJEIWCNNWDIF^ 

FGETMCTIJTAMDANSQFTSTmTAMAIDRYIATVHPISSTKFRKPSVA^ 
GliWAI^HSriPVwXYARIJPFPGGAVGCGIRIPNPDro 

IPFVWAAYVRII^RMTSS VAPASQRSmmTKRVTRTAIAICLVFFVCW APY 
10 YVlJ^LTQIJslISRPTLTFVYLYNAA^ 

KPAAQGQLRAVSNAQTADEERTESKGT 
SEQ. ID. NO. 3: Mouse MCH1R 

MDIXJASLLSTCPNASNKDGQDNFIIAGPPPRTOSVSYINID^S 
15 VGNSTVIFAVVKKSKLHWCSNWDIFIW^ 

WHFGETMCTLTTAMD ANS QFTSTYD^TAMAIDR YIATVHPIS STKFREPSMAT 
LVICIXWAI^HSITPVWLYARIJPFPGGAVGC^ 

FAIJFVWAAYVKE.QRMTSSVAPASQRSIRLRTKRVTRTAIAICLVFFVCWA 
PYYVIX3LTQLSISRPTLTFVYLYNAAISLGYANSCXNPFVYIVIx:Er^ 
20 VKPAAQGQLRTVSNAQTADEERTESKGT 

SEQ. ID. NO. 4: Human short form/mouse species chimeric MCH1R 

MDLEASLLrTGPNASNTSDGPD^TSAGSPPRTGSISYINIINffSVFGTIClXGn 
NSTWAVVKKSKIJIWCNNWDIFIINI^ 

25 FGF/TMCTLITAMDANSQFrSTYILTAMAIDRYlATVHPK 

CIXWAI^FISIIPVWXYARLffFPCKJAVGCGIRIJNPDTDLYWFIL 
LPFVVITAA YVKE.QRMTS S V APAS QRSIRIJ^TKR VTRT AIAICLVFFVC W APY 
YVLQLTQI^ISRPTLTFVYLYNAAIsmYANSCIJ^^ 
KPAAQGQLRTVSNAQTADEERTESKGT 

30 

SEQ. ID. NO. 5: Human long form/mouse species chimeric MCH1R 

MS VG AMKKGVGRAVGLGGGS GCQ ATEEDPLPNCGAC APGQGGRR WRLPQP 

AWVEGSSARLWEQATGTGWMDLEASIXPTGPNASNTSDGPDNLTSAGSPPR 
TGSISYINIIMPSWGTICIIXfflGNSTWAV^^ 

35 IIJ'LLGMPFMffiQmGNGVWHFGEmcnJTAMD 
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YLATVHPIS STKFRKPSMATLVICIXWAl^HSITPVWLYARLIPFPGGAVGCGI 
RLPNPDTDLYWFTLYQFFLAFAIJFV^^ 

TKRVTRTAIMCLVFFVCWAPYYVIX2LTQl^ISRFrLTFVYLYNAM 
(XNPFVYIVaXETFRKI^VI^VKPAAQGQLRTVSNAQTADEERTESKGT 

5 

SEQ. ID. NO. 6: GFP 

MSKGEEIJFTGVVPILVEIJ)GDWGHKTSVSGEGEGDATYG 
PWWPTLVTTFSYGVQCFSRYPDHMK 
YKTRAEVKFEGDTLVNRIEIJCGroFKEDGNimHKUE 
10 KNGKVNFKIRHNIEDGSVQLADHYQQNTPI^ 
PNEJO^HMVLLEFVTAAGITHGMDELYK 

SEQ.ID.N0.7:EGFP 

MVSKGEEIJTGVWILVEIJ)GDVNGHKFSVSGEGEGDATYGKLTLKnCTTGK 
15 IP WWPTLVTTLTYGVQCFSRYPDHMKQHDFFKS AMPEGYVQERTBFFKDDG 
NYKTRAEVKFEGDTLVNRIELJ£^ 
QKNGIKVNFKlRHNlEDGSVQIADHYQQ 
DPNEKRDHMVII^FVTAAGITLGMDELYK 

20 SEQ. ID. NO. 8: Emerald 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val Glu Leu Asp 
Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr 
Gly Lys Leu Thr Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Tip Pro Thr 
Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 

25 Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr He Phe 
Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr 
Leu Val Asn Arg He Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly 
His Lys Leu Glu Tyr Asn Tyr Asn Ser His Lys Val Tyr He Thr Ala Asp Lys Gin Lys 
Asn Gly lie Lys Val Asn Phe Lys Thr Arg His Asn lie Glu Asp Gly Ser Val Gin Leu 

30 Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn 
His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met 
Val Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys 
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SEQ. ID.NO. 9: Topaz 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val Glu Leu Asp 
Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr 
Gly Lys Leu Thr Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
5 Leu Val Thr Thr Phe Gly Tyr Gly Val Gin Cys Phe Ala Arg Tyr Pro Asp His Met Arg 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr De Phe 
Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr 
Leu Val Asn Arg He Glu Leu Lys Gly De Asp Phe Lys Glu Asp Gly Asn lie Leu Gly 
IDs Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys 
10 Asn Gly lie Lys Val Asn Phe Lys He Arg His Asn De Glu Asp Gly Ser Val Gin Leu 
Ala Asp His Tyr Gin Gin Asn Thr Pro De Gly Asp Gly Pro Val Leu Leu Pro Asp Asn 
His Tyr Leu Ser Tyr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met 
Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 

15 SEQ. ID. NO. 10: W1B 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val Glu Leu Asp 
Gly Asp Val Asn Gly His Arg Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr 
Gly Lys Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
Leu Val Thr Thr Leu Thr Trp Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 

20 Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr lie Phe 
Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr 
Leu Val Asn Arg He Glu Leu Lys Gly De Asp Phe Lys Glu Asp Gly Asn lie Leu Gly 
His Lys Leu Glu Tyr Asn Tyr lie Ser His Asn Val Tyr lie Thr Ala Asp Lys Gin Lys 
Asn Gly lie Lys Ala His Phe Lys He Arg His Asn He Glu Asp Gly Ser Val Gin Leu 

25 Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn 
IBs Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met 
Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 

SEQ. ED. NO. 11: Mouse MCHlR-linker-EGFP 

30 MDI^ASII^TGPNASNISDGQDN^ 
VGNSTVIFAVVKKSKIJErWCS 
WHFGETMOTJTAMDANSQ 

LViaiWAi^nsnpvwLYA 

FAIPFVVnAAYVHLQRMT^ 
35 PYYVI^LTQIJSISRPTLTFV^ 
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VKPAAQGQIJRTVSNAQTADEERTESKGTVDGTAGPGSIATMVSKGEELFTGV 
WILVELDGDVNGHKFSVSGEGEGDATYGKLTLKHCTTGKIPWWPTL 
TYGVQCFSRYPDHMKQHDFFKSAMPEGYVQmTM^DGNYKTRAEV^ 
GDTLVNRIELKGIDFKEDGNILGHKI^Y]^^ 
5 HMEDGSVQLADHYQQNTPIGDGPVIJJPDNHY^ 
LEFVTAAGITIjGMDELYK 

SEQ. ID. NO. 12: Mouse MCH1R/EGFP direct fusion 

MDLQASIJLSTGPNASNISDCKpiNn^^ 
10 VGNSTVIFAWKKSKIJIWCSNWDIFIINLSV^ 

WHFGETMCTLTTAMDANSQFTSTYILTAMAIDRYIATVHPIS^ 

LVKXLWALSFISnPVWLYARIJPFPGG^^ 

FAIJPFVWAAYVKnj^RMTSSVAPASQRSIRmTKRVTRT 

PYYVIX)LTQI^ISRPTLTFVYLYNAAIS 
15 VKPAAQGQI^TVSNAQTADEHRTESKGTMVSKGEELJ^GVWILVELDGDW 

GHKFS VS GEGEGD AT YGKLTLKHCTTGKLJ 5 W WPTLVTTLTYG VQCT 

HMKQHDFFKSAMPEGYVQERTIFTTCDDGN^ 

DFKEDGNILGHKLJ3YNYNSHNVYIMAD^ 

HYQQNTmGDGPVIXPDNHYI^TQSAISKDPNEKRDH^^ 

20 MDELYK 

SEQ. ID. NO. 13: Human short form/mouse species chimeric MCHlR-linker- 
EGFP 

MDIJEASIlPTGPNASNTSDGPDNLTSAGSPPRTGSISYEmiPSWGTICLLGnG 

25 NSTVIFAVVKKSKLHWCNNWDn'IINIilVVDL^ 

FGETMCTIJrrAMDANSQFTSTYlLTAMAIDRYLATVHPISSTKF 
(XLWAIiiHSIIPVwXYARIIPPRGGAVGCGI^^ 
LEFVVTTAAYVKILQRMTSSVAPASQRSIRl^ 
YVI^LTQI^ISRPTLTFVYLYNAAISLGYANSC^^ 

30 KPAAQGQmTVSNAQTADEERTESKGTVDGTAGPGSIATMVSKGEELFTGVV 
PILVELDGDVNGHKFSVSGEGEGDATYGKLTIiG?ICTC 
YGVQCTSRYPDHMKQHDFEKSAMPEGYVQERTIFFKDDGNYKTR^ 
DTLVNRlELKG]DFKEDGNttGHKLE YNYNS HNVYIMADKQKNGIKrVNFKIRH 
NIED GS VQLADHYQQOTRIGDGP VI1JDNHYLSTQS ALSKDPNEKRDHMVLL 

35 EFVTAAGITLGMDELYK 
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SEQ. DO. NO. 14: Human long form/mouse species chimeric MCHlR-linker- 
EGFP 

MSVGAMKKGVGRAVGIJ3GGSOCQATEEDPLPNCGACAPGQGGRRWRLPQP 
5 AWVEGSSARLWEQATGTGWMDLEASLLPTGPNASNTSDGPDNLTSAGSPPR 
TGSISYIISnmVIPSWGTICUJjnGNSTW^ 
IXHIXjMPFMIHQIMGNGVWHFGETMCTLJTAMDANSQF^ 
YIATVHPISSTKFRKPSMATLVICLLWAI^ 
RIJNPDTDLYWFIl.YQFFI^ALPFVWAAYVKILQRMT 
10 TKRVTRTAMICLVFFVCWAPYYVIX^LT^^ 

CIJSPFVYTVIX^TFRKRLVI^VKPAAQGQIJITVSN^ 

AGPGSIATMVSKGEEIJTCVWILVELDGDVNGHKFSVSGEGEGDATYGKLT 
LKHCTTGKIPWWPTLVTTLTYGVQCTSRYPDH3VIK 
RTIFFKDDGNYKTRAEVKFEGDTLVNRIELJKGm 
15 NVYIMADKQKTMGIKWFKIRH^ 

I^TQSAI^KDPNEBQlDHMVLLEFVTAAGniGMDELYK 

SEQ. B>. NO. 15: Human long form MCH1R cDNA 

ATGTCAGTGGGAGCCATGAAGAAGGGAGTGGGGAGGGCAGTTGGGCTTG 
20 GAGGCGGCAGCGGCTGCCAGGCTACGGAGGAAGACCCCCTTCCCAACTGC 
GGGGCTTGCGCTCCGGGACAAGGTGGCAGGCGCTGGAGGCTGCCGCAGC 
CTGCGTGGGTGGAGGGGAGCTCAGCTCGGTTGTGGGAGCAGGCGACCGG 
CACTGGCTGGATGGACCTGGAAGCCTCGCTGCTGCCCACTGGTCCCAACG 
CCAGCAACACCTCTGATGGCCCCGATAACC1GACTTCGGCAGGATCACCT 
25 CCTCGCACGGGGAGCATCTCCTACATCAACATCATCATGCCTTCGGTGTTC 
GGCACCATCTGCCTCCTGG^JCATCATCGGGAACTCCACGGTCATCTTCGCG 
GTCGTGAAGAAGTCCAAGCTGCACTGGTGCAACAACGTCCCCGACATCTT 
CATCATCAACCTCrCGGTAGTAGATCTCCTCTTTCTCCTGGGCATGCCCTT 

catgatccaccagctcatg<3gcaatg<3ggtgtg<k3actttggggagacca 
30 tgtgcaccctcatcacggccatggatgccaatagtcagttcaccagcacc 
tacatcctgaccgccatggcgattgaccgctacctggccactgtccacccc 
atctctrccacgaagttccggaagccctctgtggccaccctggtgatctgc 
ctcctgtgggccctctccttcatcagcatcacccctgtgtggctgtatgcc 
agactcatccccttcccaggaggtgcagtgggctgcg<k:ataccm:ctgcc 
35 caacccagacactgacctctactggttcaccctgtaccagtttttcctggc 
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CTITGCCCTGCCTTTTGTGGTCATCACAGCCGCATACGTGAGGATCCTGCA 
GCGCATGACGTCCTCAGTGGCCCCCGCCTCCCAGCGCAGCATCCGGCTGC 
GGACAAAGAGGGTGACCCGCACAGCCATCGCCATCTGTCTGGTCTTCTTT 
GTGTGCTGCK3CACCCTACTATGTGCTACAGCTGACCCAGTTGTCCATCAGC 
5 CGCCCGACCCTCACCTrTGTCTACTTATACAATGCGGCCATCAGCTTGGGC 
TATGCCAACAGCrGCCTCAACCCCTITGTGTACATCGTGCTCTGTGAGACG 
TTCCGCAAACGCTTGGTCCTGTCGGTGAAGCCTGCAGCCCAGGGGCAGCT 
TCGCGCTGTCAGCAACGCTCAGACGGCTGACGAGGAGAGGACAGAAAGC 
AAAGGCACCTGA 

10 

SEQ. ID. NO. 16: Human short form MCH1R cDNA 

ATGGACCTGGAAGCCTCGCTGCTGCCCACTGGTCCCAATGCCAGCAACAC 
CTCTGATGGCCCCGATAACCTCACTTCGGCAGGATCACCTCCTCGCACGG 
GGAGCATCTCCTACATCAACATCATCATGCCTTCGGTGTTCGGCACCATCT 

15 GCCTCCTGGGCATCATCGGGAACTCCACGGTCATCTTCGCGGTCGTGAAG 
AAGTCCAAGCTGCACTGGTGCAACAACGTCCCCGACATCTTCATCATCAA 
CCTCTCGGTAGTAGATCrCCTCTTTCTCCTCH3GCATGCCCTTCATGATCCA 
CCAGKJTCATGGGCAATGGGGTGTGGCACTTTGGGGAGACCATGTGCACCC 
TCATCACGGCCATGGATGCCAATAGTCAGTTCACCAGCACCTACATCCTG 

20 ACCGGCATGGCCATTGACCGCTACCTGGCCACTGTCCACCCCATCTCTTCC 
ACGAAGTTCCGGAAGCCCTCTGTGGCCACCCTGGTGATCTGCCTCCTGTGG 
GCCCTCTCCTTCATCAGCATCACCCCTGTGTGGCTGTATGCCAGACTCATC 
CCCTTCCCAGGAGGTGCAGTGGGCTGCGGCATACGCCTGCCCAACCCAGA 
CACTGACCTCTACTGGTTCACCCTGTACCAGTTrTTCCTGGCCTlTGCCCTG 

25 CCTTTTGTGGTCATCACAGCCGCATACGTGAGGATCCTGCAGCGCATGAC 
GTCCTCAGTGGCCCCCGCCTCCCAGCGCAGCATCCGGCTGCGGACAAAGA 
GGGTGACCCGCACAGCCATCGCCATCTGTCTGGTCTTCTTTGTGTGCTGGG 
CACCCTACTATGTGCTACAGCTGACCCAGTTGTCCATCAGCCGCCCGACCC 
TCACCTTTGTCTACTTATACAATGCGGCCATCAGCTTGGGCTATGCCAACA 

30 GCTGCCTCAACCCCTTTGTGTACATCGTGCTCTGTGAGACGTTCCGCAAAC 
GCTTGGTCCTGTCGGTGAAGCCTGCAGCCCAGGGGCAGCTTCGCGCTGTC 
AGCAACGCTCAGACGGCTGACGAGGAGAGGACAGAAAGCAAAGGCACCT 
GA 
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SEQ. ID. NO. 17: Mouse MCH1R cDNA 

Nucleic acid sequence start and stop codons are highlighted: 

GGCGGTAGAGGAAGACCCTITICTGGACTGCGGGGCTCAAGCTCCGGACA 
AGGCGGTGGAGGGCGCTGGAGGCTGCCGCAGCCTGCGTGGGTGGACGGG 
5 CGCTCCACTCCAGGGAGCAGGCGACCTGCACCGGCTGCATGGATCTGCAA 
GCCTCGTTGCTGTCCACTGGCCCCAATGCCAGCAACATCTCCGATGGCCA 
GGATAATTrcACATTGGCGGGGCCACCTCCTCGCACAAGGAGTGTCTCCT 
ACATCAACATCATCATGCCTTCAGTGTTTGGTACCATCTGTCTCCTGGGCA 
TTGTGGGAAACTCCACAGTCATTTTTGCCGTGGTGAAGAAATCCAAGCTG 
10 CACTGGTGCAGCAACGTCCCTGACATCTTCATCATCAACCTCTCTGTGGTG 
GATCTGCTITrccnX3CTGGGCATGCCTI^ 

AATGGTGTCTGGCACTTTGGGGAAACCATGTGCACCCTCATCACAGCCAT 
GGACGCCAACAGTCAGTTCACCAGCACCTACATCCTGACTGCTATGGCCA 
TTGACCGCTACTrGGCCACCGTCCATCCCATCTCCTCCACCAAGTTCCGGA 

15 AGCCCTCCATGGCCACCCTGGTGATCTGCCTCCTGTGGGCTCTCTCGTTCA 
TTAGCATCACTCCTGTGTGGCTCTATGCCAGGCTTATCCCCTTCCCAGGGG 
GTGCTGTGGGCTGTGGCATCCGCCTACCAAACCCAGATACTGATCTITACT 
GGTTCACTCTGTATCAGTTTTTCCTGGCCTTCGCCCTTCCGTTTGTGGTCAT 
CACTGCTGCGTACGTGAAAATACTACAGCGCATGACGTCTTCGGTGGCCC 

20 CAGCCTCTCAACGCAGCATCCGGCTTCGGACAAAGAGGGTGACCCGCACA 
GCCATTGCCATCTGTCTGGTCTTCTTTGTGTGCTGGGCGCCCTACTACGTG 
CTGCAGCTGACCCAGTTGTCCATCAGCCGCCCGACCCTCACATTCGTCTAC 
CTGTACAATGCGGCCATCAGCTTGGGCTATGCCAACAGCTGCCTCAATCC 
CITrGTGTACATAGTACTCTGTGAGACCTITCGAAAACGKnTGGTGCTGTC 

25 GGTGAAGCCCGCGGCCCAGGGGCAGCTTCGCACGGTCAGCAATGCTCAGA 
CAGCTGACGAGGAGAGGACAGAAAGCAAAGGCACCTGACAATCCCCCCC 
GGTCACCTCCAAGTCAGGTCACCGCATCAAACCATGGGGAGAGATACTGA 
GATAAACCCGGGGCTACCCTGGGAGGATGCAGAAGCTGGAGGCTGGGGG 
CTTGTAGCAAACCACATTCCACGGGGCCCACAAATTGCTAGGGAGGCTTG 

30 CAGCCTGGTTTGGGGGGGAAGCCTCAGACTGCAGGGATCCCCTTGACAGA 
ATAGAAGCGGAGCAAGAAGGAAAGGGTGGTTTGACTGGTTCTCGGGGTCT 
GTATCTGTTGGCTCGCATATATCTTTCTCTCAAGGGAAGAAGGCGGAGGT 
CK^CTAGCTCHjGTTCCTTTAAAACTAGGCAGGGCTAGGATCTGAGCAGCTA 
GGGCTCTACTGTGAGACTGGGCAAGCCGAGCGTTCCCTCCCATCTCTCATT 

35 GGTGTTGATAGAAGGCAGTCTTTCTCCCAAGCTGGTGGATCTCCTGAAGC 
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ACGCTGCCTGGGCTCCAGCATCCTGTGCGGATTTCACGTTCTCTTTAGGGG 
ATGGATGTTGACACTGGGGTGTGGGCTCTGAGCCCACAGGAGTTTAAAAA 
ACCAAAAGAGCTCAGAGTGTCGAGAGAGACCCAATCACCGAGAATGACA 
AGGCAACCTGGGGTGGATGTGGATCTTGAAACTAATAAAAAGGGGTTTTC 
5 ACAGTGACAGCGACATTCTCTTCATAGGGCACAGCTGTCAGTCTATGGCT 
GATCCAGAGCGAGCATCCATGAATTCTGCATGTGCAGGGGTCACTCTAAT 
ACCTGATATGTTGGCATCATCTTTGTGCITGAGCCTTCCNCTCCCAAATGG 
GAATGAAATAAAGGCAAATTCCCNCCCCCCCCAAAAAAGGGGNAAAAAA 
AAAAAAAAAAAAAAAAAAAA 

10 

SEQ. ID. NO. 18: Mouse MCH1R genomic DNA 

Nucleic acid sequence start and stop codons, as well as intron borders, are 
highlighted: 

GGCGGTAGAGGAAGACCCTTrTCTGGACTGCGGGGCTCAAGCTCCGGACA 

15 AGGCGGTGGAGGGCGCTGGAGGCTGCCGCAGCCTGCGTGGGTGGACGGG 
CGCTCCACTCCAGGGAGCAGGCGACCTGCACCGGCTGCATGGATCTGCAA 
GCCTCGTTGCTGTCCACTGGCCCCAATGCCAGCAACATCTCCGATGGCCA 
GGATAATTTCACATTGGCGGGTGAGTCGAGTTGGAGTCCTCCCTCCTCCG 
GGATGGGTGTGGAAAATGGGAAGGTTTCACCTCCCAAGCCAAACTGCCTG 

20 GGAAACTTTATCTrACAGTrCTTGGTGATAAGATCTGCAGTCGGCTTTGCC 
TGAAGAGGAAGAGGAGAGGAGGGGACACCAGCTAGGACAGAAGGGGCA 
GGGAGGAATAGAGATGGGGCAGAGGCACATTTAGAAACAACAAGGGTTG 
GTGACAAGACGTGAGGCAGGCTTGAGGGK3AAAGCTTGCTGATGAGTCCCA 
AATATGCTTTGCAGGGGGGGGGGGGGGGGAATCAAGGCTGGAGAAGCAA 

25 GCAAGCAAGACAGCAAGACAGCGGGCGGGTAGTATGTGGGAGCCAGCAG 
AAGCGCTTTGATTCACCGCTATCCTGGGCTCAATCCTCTGGCCTCGCACTG 
GGGAAATGGGGTCTGAGTGGTCCTTGCTGTCTTCTGGCAAAGGCTGCTGG 
GAGCAAAAGACTTCACAGGGCGTGAGAGGATTAACTTTTCTGGTGAATTA 
AGCTTCTTGACATTTGCAGAACGTCAATGCCTTAAAATTCTAGCTCTGAAG 

30 GAGAAGGGAATGAAGGGGAAAGAGGGAAGGTTGGTGTGGAGAAATTCCC 
AAGCTTCTGGGGTGTAACACAGCTCCAGTCCCTACCCTATTGGGAAAGCC 
CAGACTCAGGAGACATGGTCCAAGGAAATCCCTGACAGAAAACCGGGAG 
AGGGCAGGGCTGTGGAGCCTGAAACACACCCCACACCCATGGTGACAGTC 
ACTTCTCACATATGCCTAGGAACCTATCTGAAACCTTTGGCCATCTCTCTC 

35 TGAAAAGATGAGGCTGCAAATACACACACACACACACACACACACACAC 
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ACACACACACACACACACACACACACACACACACAAATGTCCTTCAAGCC 
11 1 llGACAAGGTTITCTGGTGGATCCCGGGGATATGAAGTraTrcrcAGC 
AGATATCTGGGAGTOTGACTCCTGGCCCTCIX3AGTAAATXK5ATGAAGCG 
AAGAAGAATGGGGTCCTCTGAGTAACAGGTGGATCTAGAAAATCCTATAG 
5 GAGTCACCAGGGCACGGTGGAGGAGGGTAAGGTACAGAACTAACAATAG 
CCCGAGAAGGGGAAACAGCAGGAGATGATTCCAGAGACGTAGTGACCCC 
AAGCTGCAAGGGAAAGCATGAGGGGCCAGCAGGAAGGCCGACATGGCAG 
GTTGTCAGCTTCTAGATCGGAAGGCGGGTCACACTTGCTCTrrCTATCCTC 
AGGGCCACCTCCTCGCACAAGGAGTGTCTCCTACATCAACATCATCATGC 

10 CTTCAGTGTTTGGTACCATCTGTCTCCTGGGCATTGTGGGAAACTCCACAG 
TCATTTTTGCCGTGGTGAAGAAATCCAAGCTGCACTGGTGCAGCAACGTC 
CCTGACATCTTCATCATCAACCTCTCTGTGGTGGATCTGCITITCCTGCTGG 
GCATGCCTTTCATGATCCACCAGCTCATGGGTAATGGTGTCTGGCACTTTG 
GGGAAACCATGTGCACCCTCATCACAGCCATGGACGCCAACAGTCAGTTC 

15 ACCAGCACCTACATCCTGACTGCTATGGCCATTGACCGCTACTTGGCCACC 
GTCCATCCCATCTCCTCCACCAAGTTCCGGAAGCCCTCCATGGCCACCCTG 
GTGATCTGCCTCCTGTGGGCTCTCTCGTTCATTAGCATCACrCCTGTGTCG 
CTCTATGCCAGGCTTATCCCCTTCCCAGGGGGTGCTGTGGGCTGTGGCATC 
CGCCTACCAAACCCAGATACTGATCTTTACTGGTrCACTCTGTATCAGTTT 

20 TTCCTGGCCITCGCCCTTCCGTrTGTGGTCATCACTGCTGCGTACGTGAAA 
ATACTACAGCGCATGACGTCTTCGGTGGCCCCAGCCTCTCAACGCAGCAT 
CCGGCTTCGGACAAAGAGGGTGACCCGCACAGCCATTGCCATCTGTCTGG 
TCTTGTTrcTGTGCTGGGCGCCCTACTACGTGCTGCAGCTGACCCAGTTGT 
CCATCAGCCGCCCGACCCTCACATTCGTCTACCTGTACAATGCGGCCATCA 

25 GCTTGGGCTATGCCAACAGCTGCCTCAATCCCTTTGTGTACATAGTACTCT 
GTGAGACCTTTCGAAAACGCTTGGTGCTGTCGGTGAAGCCCGCGGCCCAG 
GGGCAGCTTCGCACGGTCAGCAATGCTCAGACAGCTGACGAGGAGAGGA 
CAGAAAGCAAAGGCACCTGACAATCCCCCCCGGTCACCTCCAAGTCAGGT 
CACCGCATCAAACCATGGGGAGAGATACTGAGATAAACCCGGGGCTACC 

30 CTGGGAGGATGCAGAAGCTGGAGGCTGGGGGCTTGTAGCAAACCACATTC 
CACGGGGCCCACAAATTGCTAGGGAGGCTTGCAGCCTGGTTTGGGGGGGA 
AGCCTCAGACTGCAGGGATCCCCTTGACAGAATAGAAGCGGAGCAAGAA 
GGAAAGGGTGGTTTGACTGGTTCTCGGGGTCTGTATCTGTTGGCTCGCATA 
TATCTTrCTCTCAAGGGAAGAAGGCGGAGGTGCCTAGCTOGGTTCCTTTA 

35 AAACTAGGCAGGGCTAGGATCTGAGCAGCTAGGGCTCTACTGTGAGACTG 
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GGCAAGCCGAGCGTTCCCTCCCATCTCTCATTGGTGTTGATAGAAGGCAG 
TCrrTCTCCCAAGCTGGTGGATCTCCTGAAGCACGCTGCCTGGGCTCCAGC 
ATCCTGTGCGGATTTCACGTTCTCTTTAGGGGATGCATGTTGACACTGGGG 
TGTGGGCTCTGAGCCCACAGGAGTrTAAAAAACCAAAAGAGCTCAGAGTG 

5 TCGAGAGAGACCCAATCACCGAGAATGACAAGGCAACCTGGGGTGGATG 
TGGATCTTGAAACTAATAAAAAGGGGTTTTCACAGTGACAGCGACATTCT 
CTTCATAGGGCACAGCTGTCAGTCTATGGCTGATCCAGAGCGAGCATCCA 
TGAATTCTGCATGTGCAGGGGTCACTCTAATACCTGATATGTTGGCATCAT 
CTTTGTGCTTGAGCCTrCCNCTCCCAAATGGGAATGAAATAAAGGCAAAT 

10 TCCCNCCCCCCCCAAAAAAGGGGNAAAAAAAAAAAAAAAAAAAAAAAA 
AA 

SEQ. ID. NO. 19: Human short form/mouse species chimeric MCH1R 

ATGGACCTGGAAGCCTCGCTGCTGCCCACTGGTCCCAATGCCAGCAACAC 

15 CTCTGATGGCCCCGATAACCTCACTrCGGCAGGATCACCTCCTCGCACGG 
GGAGCATCTCCTACATCAACATCATCATGCCTTCGGTGTTCGGCACCATCT 
GCCTCCTGGGCATCATCGGGAACTCCACGGTCATCTTCGCGGTCGTGAAG 
AAGTCCAAGCTGCACTGGTGCAACAACGTCCCCGACATCTTCATCATCAA 
CCTCTCGGTAGTAGATCTCCTCTTTCTCCTGGGCATGCCCTTCATGATCCA 

20 CCAGCTCATGGGCAATGGGGTGTGGCACnTGGGGAGACCATGTGCACCC 
TCATCACGGCCATGGATGCCAATAGTCAGTTCACCAGCACCTACATCCTG 
ACCGICCATGGCCATTGACCGCTACCrCKX:CACTGTCCACCCCATCTCTrCC 
ACGAAGTTCCGGAAGCCCTCCATGGCCACCCTGGTGATCTGCCTCCTGTG 
GGCTCTCTCGTTCATTAGCATCACTCCTGTGTGGCTCTATGCCAGGCTTAT 

25 CCCCTTCCCAGGGGGTGCTGTGGGCTGTGGCATCCGCCTACCAAACCCAG 
ATACTGATCTITACrGGTTCACTCTGTATCAGTTTTTCCTGGCCTTCGCCCT 
TCCGTTTGTGGTCATCACTGCTGCGTACGTGAAAATACTACAGCGCATGAC 
GTCTTCGGTGGCCCCAGCCTCTCAACGCAGCATCCGGCTTCGGACAAAGA 
GGGTGACCCGCACAGCCATTGCCATCTGTCTGGTCTTCTTTGTGTGCTGGG 

30 CGCCCTACTACGTGCTGCAGCTGACCCAGTTGTCCATCAGCCGCCCGACC 
CTCACATTCGTCTACCTGTACAATGCGGCCATCAGCTTGGGCTATGCCAAC 
AGCTGCCTCAATCCCTTTGTGTACATAGTACTCTGTGAGACCTT^ 
CGCTTGGTGCTGTCGGTGAAGCCCGCGGCCCAGGGGCAGCTTCGCACGGT 
CAGCAATGCTCAGACAGCTGACGAGGAGAGGACAGAAAGCAAAGGCACC 

35 TGA 
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SEQ. ID. NO. 20: Human long form/mouse species chimeric MCH1R 

ATGTCAGTGGGAGCCATGAAGAAGGGAGTGGGGAGGGCAGTTGGGCTTG 
GAGGCGGCAGCGGCTGCCAGGCTACGGAGGAAGACCCCCTTCCCAACTGC 
5 GGGGCTTGCGCTCCGGGACAAGGTGGCAGGCGCTGGAGGCTGCCGCAGC 
CTGCGTGGGTGGAGGGGAGCTCAGCTCGGTTGTGGGAGCAGGCGACCGG 
CACTGGCTGGATGGACCTGGAAGCCTCGCTGCTGCCCACTGGTCrc 
CCAGCAACACCTCTGATGGCCCCGATAACCTCACTTCGGCAGGATCACCT 
CCTCGCACGGGGAGCATCTCCTACATCAACATCATCATGCCTTCGGTGTTC 
10 GGCACCATCTGCCTCCTGGGCATCATCGGGAACTCCACGGTCATCTTCGCG 
GTCGTGAAGAAGTCCAAGCTGCACTGGTGCAACAACGTCCCCGACATCTT 
CATCATCAACCTCTCGGTAGTAGATCTCCTCTTTCTCCTGGGCATGCCCTT 

catgatccaccagctcatgggcaatggggtgtggcactttggggagacca 
tgtgcaccctcatcacggccatggatgccaatagtcagttcaccagcacc 
15 tacatcctgaccgccatggccattgaccgctacctggccactgtccacccc 
atctcttccacgaagttccggaagk:cctccatggccaccctggtgatctgc 

CTCCTGTGGGCTCTCTCGTTCATTAGCATCACTCCTGTGTGGCTCTATGCC 

aggcttatccccttcccagggggtgctgtgggctgtggcatccgcctacca 

AACCCAGATACTGATCTTTACTGGTTCACTCTGTATCAGTTTTTCCTGGCCT 
20 TCGCCCTTCCGTITGTGGTCATCACTGCTGCGTACGTGAAAATACTACAGC 
GCATGACGTCTrCGGTGGCCCCAGCCTCTCAACGCAGCATCCGGCTrCGG 
ACAAAGAGGGTGACCCGCACAGCCATTGCCATCTGTCTGGTCTTCTTTGTG 
TGCTGGGCGCCCTACTACGTGCTGCAGCTGACCCAGTTGTCCATCAGCCGC 
CCGACCCTCACATTCGTCTACCTGTACAATGCGGCCATCAGCTTGGGCTAT 
25 GCCAACAGCTGCCTCAATCCCTTTGTGTACATAGTACTCTGTGAGACCTTT 
CGAAAACGCTTGGTGCTGTCGGTGAAGCCCGCGGCCCAGGGGCAGCTTCG 
CACGGTCAGCAATGCTCAGACAGCTGACGAGGAGAGGACAGAAAGCAAA 
GGCACCTGA 

30 SEQ. ID. NO. 21: Aequorea victoria Green Fluorescent Protein (GFP) cDNA 
Nucleic acid sequence start and stop codons are highlighted: 
TACACACGAATAAAAGATAACAAAGATGAGTAAAGGAGAAGAACTTTTC 
ACTGGAGTTGTCCCAATTCTTGTTGAATTAGATGGTGATGTTAATGGGCAC 
AAATTTTCTGTCAGTGGAGAGGGTGAAGGTGATGCAACATACGGAAAACT 

35 TACCCTTAAATTTATTTGCACTACTGGAAAACTACCTGTTCCATGGCCAAC 
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ACTrGTCACTACTTTCTCTTATGGTGTrCAATGCTTTrCAAGATACCCAGAT 
CATATGAAACAGCATGACTTTTTCAAGAGTGCCATGCCCGAAGGTTATGT 
ACAGGAAAGAACTATATTTTTCAAAGATGACGGGAACTACAAGACACGTG 
CTGAAGTCAAGTTTGAAGGTGATACCCTTGTTAATAGAATCGAGTTAAAA 
5 GGTATTGATTTTAAAGAAGATGGAAACATTCITGGACACAAATTGGAATA 
CAACTATAACTCACACAATGTATACATCATGGCAGACAAACAAAAGAATG 
GAATCAAAGTTAACITCAAAATTAGACACAACATTGAAGATGGAAGCGTT 
CAACTAGCAGACCATTATCAACAAAATACTCCAATTGGCGATGGCCCTGT 

ccttttaccagacaaccattacctgtccacacaatcixk:cctttcgaaaga 
10 tcccaacgaaaagagagaccacatggtccttcttgagtttgtaacagctg 
ctgggattacacatggcatggatgaactatacaaataaatgtccagactt 

CCAATTGACACTAAAGTGTCCGAACAATTACTAAAATCTCAGGGTTCCTG 
GTTAAATTCAGGCTGAGATATTATTTATATATTTATAGATTCATTAAAATT 
GTATGAATAATTTATTGATGTTArrGATAGAGGrrATTTTCTTATTAAACA 
15 GGCTACTTGGAGTGTATTCTTAATTCTATATTAATTACAATITGATTTGACT 
TGCTCAAA 

SEQ. ID. NO. 22: EGFP + Linker 

Nucleic acid sequence start and stop codons are highlighted and a 12 amino acid 

20 linker sequence is denoted in lower case: 

gtcgacggtaccgcgggcccgggatccatcgccaccATGGTGAGCAAGGGCGAGGAGCTGTT 
CACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCC 
ACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAA 
GCTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGC 

25 CCACCCTCGTGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTAC 
CCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGG 
CTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGA 
CCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAG 
CTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGC 

30 TGGAGTACAACTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAG 
AAGAAGGGCATCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACG 
GCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGAC 
GGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCCAGTCCGCCCT 
GAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCG 
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TGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAGTAAAGC 
GGCCGC 

SEQ. ID. NO. 23: Emerald 
5 ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT 
CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG 
GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC 
CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCTTGACCT 
ACGGCGTGCAGTGCTTCGCCCGCTACCCCGACCACATGAAGCAGCACGAC 

10 TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC 
TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG 
GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG 
GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACA 
AGGTCTATATCACCGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTC 

15 AAGACCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACT 
ACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAAC 
CACTACCTGAGCACCCAGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCG 
CGATCACATGGTCCTGCTGGAGTrcGTGACCGCCGCCGGGATCACTCTCG 
GCATGGACGAGCTGTACAAGTAA 

20 

SEQ. ID. NO. 24: Topaz 

atggtgagcaagggcgaggagctgttcaccggggtggtgcccatcctggt 

cgagctggacggcgacgtaaacggccacaagttcagcgtgtccggcgag 

ggcgagggcgatgccacctacggcaagctgaccctgaagttcatctgcac 

25 caccggcaagctgcccgtgccctggcccaccctcgtgaccaccttcggct 
acggcgtgcagtgctrcgcccgctaccx:cgaccacatgcgccagcacgac 
ttcttcaagtccgccatgcccgaaggctacgtccaggagcgcaccatcttc 
ttcaaggacgacggcaactacaagacccgcgccgaggtgaagttcgagg 
gcgacaccctggtgaaccgcatcgagctgaagggcatcgacttcaaggag 

30 gacggcaacatcctggggcacaagctggagtacaactacaacagccaca 
acgtctatatcatggccgacaagcagaagaacggcatcaaggtgaacttc 
aagatccgccacaacatcgaggacggcagcgtgcagctcgccgaccacta 
ccagcagaacacccccatcggcgacggccccgtgctgctgcccgacaacc 
actacctgagctaccagtccgccctgagcaaagaccccaacgagaagcgc 
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GATCACATGGTCCTGCTGOAGTTCGTGACCGCCGCCGGGATCACTCTCGG 
CATGGACGAGCTGTACAAGTAA 

SEQ.ID.N0.25:W1B 

5 ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT 
CGAGCTGGACGGCGACGTAAACGGCCACAGGTTCAGCGTGTCCGGCGAG 
GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC 
CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT 
GGGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC 

10 TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGTACCATCTTC 
TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG 
GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG 
GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACATCAGCCACA 
ACGTCTATATCACCGCCGACAAGCAGAAGAACGGCATCAAGGCCCACTTC 

15 AAGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTA 
CCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACC 
ACTACCTGAGCACCCAGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGC 
GATCACATGGTCCrGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGG 
CATGGACGAGCTGTACAAGTAA 

20 

SEQ. ID. NO. 26: Mouse MCHlR-linker-EGFP 

Nucleic acid sequence start codon and start and stop codons for mouse MCH1R and 
EGFP, respectively, as well as intron borders, are highlighted and a 12 amino acid 
linker sequence is denoted in lower case: 

25 ATGGATCTGCAAGCCTCGTTGCTGTCCACTGGCCCCAATGCCAGCAACAT 
CTCCGATG<}CCAGGATAATTTCACATraGCG^TGAGTCGAGTTGGAGTC 
CTCCCTCCTCCGGGATGGGTGTGGAAAATGGGAAGGTTTCACCTCCCAAG 
CCAAACTGCCTGGGAAACTTTATCTTACAGTTCTTGGTGATAAGATCTGCA 
GTCGGCTTTGCCTGAAGAGGAAGAGGAGAGGAGGGGACACCAGCTAGGA 

30 CAGAAGCK3GCAG<3GAGGAATAGAGATGG<KK:AGAGGCACATTTAGAAAC 
AACAAGCK3TTGGTGACAAGACGTGAGGCAGGCTTGAGGGGAAAGCTTGC 
TGATGAGTCCCAAATATG<nTTGCAGGGG<3GG<3GGGGGGGGAATCAAGG 
CTGGAGAAGCAAGCAAGCAAGACAGCAAGACAGCGGGCGGGTAGTATGT 
GGGAGCCAGCAGAAGCGCTTTGATTCACCGCTATCCTGGGCTCAATCCTC 

35 TGGCCTCGCACrGGGGAAATGGGGTCTGAGTGGTCCTTGCTGTCTTCTGGC 
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AAAGGCTGCTGGGAGCAAAAGACITCACAGGGCGTGAGAGGATTAACTTT 
TCIXjGTGAATTAAGCTTCTTGACATTTGCAGAACGTCAA 
CTAGCTCTGAAGGAGAAGGGAATGAAGGGGAAAGAGGGAAGGTTGGTGT 
GGAGAAATTCCCAAGCTTCTGGGGTGTAACACAGCTCCAGTCCCTACCCT 
5 ATTGGGAAAGCCCAGACTCAGGAGACATGGTCCAAGGAAATCCCTGACA 
GAAAACCGGGAGAGGGCAGGGCTGTGGAGCCTGAAACACACCCCACACC 
CATGGTGACAGTCACrTCTCACATATGCCTAGGAACCTATCTGAAACCTTT 
GGCCATCTCTCTCTGAAAAGATGACKXn'GCAAATACACACACACACACAC 
ACACACACACACACACACACACACACACACACACACACACACACACAAA 

10 TGTCCTTCAAGCCTTTTTGACAAGGTTTTCTGGTGGATCCCGGGGATATGA 
AGTTGTTCTCAGCAGATATCTGGGAGTCTTGACTCCTGGCCCTCTGAGTAA 
ATGGATGAAGCGAAGAAGAATGGGGTCCTCTGAGTAACAGGTGGATCTA 
GAAAATCCTATAGGAGTCACCAGGGCACGGTGGAGGAGGGTAAGGTACA 
GAACTAACAATAGCCCGAGAAGGGGAAACAGCAGGAGATGATTCCAGAG 

15 ACGTAGTGACCCCAAGCTGCAAGGGAAAGCATGAGGGGCCAGCAGGAAG 
GCCGACATGGCAGGTTGTCAGCTTCTAGATCGGAAGGCGGGTCACACTTG 
CTCTTTCTATCCTCAGGGCCACCTCCTCGCACAAGGAGTGTCTCCTACATC 
AACATCATCATGCCTTCAGTGTTTGGTACCATCTGTCTCCTGGGCATTGTG 
GGAAACTCCACAGTCATTTTTGCCGTGGTGAAGAAATCCAAGCTGCACTG 

20 GTGCAGCAACGTCCCTGACATCTTCATCATCAACCTCTCTGTGGTGGATCT 
GCTTTTCCTGCTGGGCATGCCTTTCATGATCCACCAGCTCATGGGTAATGG 
TGTCTGGCACnTGGGGAAACCATGTGCACCCTCATCACAGCCATGGACG 
CCAACAGTCAGTTCACCAGCACCTACATCCTGACTGCTATGGCCATTGACC 
GCTACTTGGCCACCGTCCATCCCATCTCCTCCACCAAGTTCCGGAAGCCCT 

25 CCATGGCCACCCTGGTGATCTGCCTCCTGTGGGCTCTCTCGTTCATTAGCA 
TCACTCCTGTGTGKjCIUTATGCCAGGCITATCCCCTTCCCAGGGG^ 
TGGGCTGTGGCATCCGCCTACCAAACCCAGATACTGATCTTTACTGGTTCA 
CTCTGTATCAGTTTTTCCTGGCCTrCGCCCTTCCGTTTGTGGTCATCACTGC 
TGCGTACGTGAAAATACTACAGCGCATGACGTCTTCGGTGGCCCCAGCCT 

30 CTCAACGCAGCATCCGGCTTCGGACAAAGAGGGTGACCCGCACAGCCATT 
GCCATCTGTCTGGTCTTCTTTGTGTGCTGGGCGCCCTACTACGTGCTGCAG 
CTGACCCAGTTGTCCATCAGCCGCCCGACCCTCACATTCGTCTACCTGTAC 
AATGCGGCCATCAGCTTGGGCTATGCCAACAGCTGCCTCAATCCCTTTGTG 
TACATAGTACTCnXjTGAGACCTTTCGAAAACGCTTGGTGCTGTCGGTGAA 

35 GCCCGCGGCCCAGGGGCAGCTTCGCACGGTCAGCAATGCTCAGACAGCTG 
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ACGAGGAGAGGACAGAAAGCAAAGGCACCgtcgacggtaccgcgggcccgggatccatcg 
ccaccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCC 
TGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGC 
GAGGGCGAGGGCGATGCCACCrACGGCAAGCTGACCCTGAAGTTCATCTG 

5 CACCACCGGCAAGCTGCCCGTC3CCCTGGCCCACCCTCGTGACCACCCTGA 
CCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCAC 
GACTTCTrCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCAT 
CTTCTrCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCG 
AGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAG 

10 GAGGACGGCAACATCCTCGGGCACAAGCTGGAGTACAACTACAACAGCe 
ACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAA 
CTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACC 
ACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGAC 
AACCACTACCTGAGCACCCAGTCCGCCCTGAGCAAAGACCCCAACGAGAA 

15 GCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTC 

TCGGCATGGACGAGCTGTACAAGTAA 

SEQ. ED. NO. 27: Mouse MCH1R/EGFP direct fusion 

Nucleic acid sequence start codon and start and stop codons for mouse MCH1R and 

20 EGFP, respectively, as well as intron borders, are highlighted: 

ATGGATCTGCAAGCCTCGTTGCTGTCCACTGGCCCCAATGCCAGCAACAT 

CrcCGATGGCCAGGATAATTTCACATTGGCGGGTGAGTCGAGTTGGAGTC 

CrcCCTCCTCCGGGATGGGTGTGGAAAATGGGAAGGTTTCACCTCCCAAG 

CCAAACTGCCTGGGAAA(mTATCTTACAGTTCTTGGTGATAAGATCTGCA 

25 GTCGGCTTTGCCTGAAGAGGAAGAGGAGAGGAGGGGACACCAGCTAGGA 
CAGAAGGGGCAGGGAGGAATAGAGATGGGGCAGAGGCACATTTAGAAAC 
AACAAGGGTTGGTGACAAGACGTGAGGCAGGCTTGAGGGGAAAGCTTGC 
TGATGAGTCCCAAATATGCTTTGCAGGGGGGGGGGGGGGGGAATCAAGG 
CTGGAGAAGCAAGCAAGCAAGACAGCAAGACAGCGGGCGGGTAGTATGT 

30 GGGAGCCAGCAGAAGCGCTITGATTCACCGCTATCCTGGGCTCAATCCTC 
TGGCCTCGCACTGGGGAAATGGGGT^AGTGGTCCTTGCrGTCiTCTGGC 
AAAGGCTGCTGGGAGCAAAAGACTTCACAGGGCGTGAGAGGATTAACTTT 
TCTGGTGAATTAAGCTrCTrGACATTTGCAGAACGTCAATGCCTTAAAATT 
CTAGCTCTGAAGGAGAAGGGAATGAAGGGGAAAGAGGGAAGGTTGGTGT 

35 GGAGAAATTCCCAAGCTTCTGGGGTGTAACACAGCTCCAGTCCCTACCCT 
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ATTGGGAAAGCCCAGACTCAGGAGACATGGTCCAAGGAAATCCCTGACA 
GAAAACCGCK3AGAGGGCAGGGCTGTGGAGCCTGAAACACACCCCACACC 
CATGGTGACAGTCACTrCTCACATATGCCTACKSAACCTATCIGAAACCm 
GGCCATCTCTCTCTGAAAAGATGAGGCTGCAAATACACACACACACACAC 
5 ACACACACACACACACACACACACACACACACACACACACACACACAAA 
TCTCCTTCAAGCCTITITGACAAGGTITrcTGGTG^ 

AGTTGTTCTCAGCAGATATCrGGGAGTCTrcACTCCTGGCCCTCTGAGTAA 

ATGGATGAAGCGAAGAAGAATGGGGTCCTCTGAGTAACAGGTGGATCTA 

GAAAATCCTATAGGAGTCACCAGGGCACGGTGGAGGAGGGTAAGGTACA 

10 GAACTAACAATAGCCCGAGAAGGGGAAACAGCAGGAGATGATTCCAGAG 
ACGTAGTGACCCCAAGCTGCAAGGGAAAGCATGAGGGGCCAGCAGGAAG 
GCCGACATGGCAGGTTGTCAGCTTCTAGATCGGAAGGCGGGTCACACTTG 
CTCTlTCrATCCTCAGGGCCACCTCCTCGCACAAGGAGTGTCTCCTACATC 
AACATCATCATGCCTTCAGTGTITGGTACCATCTGTCrCCTGGGCATTGTG 

15 GGAAACTCCACAGTCATTTITGCCGTGGTGAAGAAATCCAAGCTGCACTG 
GTGCAGCAACGTCCCTGACATCTTCATCATCAACCTCTCTGTGGTGGATCT 
GCTTTTCCTGCTGGGCATGCCTTTCATGATCCACCAGCTCATGGGTAATGG 
TGTCTGGCACTrTGGGGAAACCATGTGCACCCTCATCACAGCCATGGACG 
CCAACAGTCAGTrCACCAGCACCTACATCCTGACTGCTATGGCCATTGACC 

20 GCTACTIGGCCACCGTCCATCCCATCTCCTCCACCAAGTTCCGGAAGCCCT 
CCATGGCCACCCTGGTGATCTGCCTCCTGTGGGCTCTCTCGTTCATTAGCA 
TCACTCCTGTGTGGCTCTATGCCAGGCTTATCCCCTTCCCAGGGGGTGCTG 
TGGGCTGTGGCATCCGCCTACCAAACCCAGATACTGATCTTTACTGGTTCA 
CTCTGTATCAGTTTTTCCTGGCCTTCGC 

25 TGCGTACGTGAAAATACTACAGCGCATGACGTCTrCGGTGGCCCCAGCCT 
CTCAACGCAGCATCCGGCTTCGGACAAAGAGGGTGACCCGCACAGCCATT 
GCCATCTGTCTGGTCTTCTTTGTGTGCTGGGCGCCCTACTACGTGCTGCAG 
CTGACCCAGTTGTCCATCAGCCGCCCGACCCTCACATTCGTCTACCTGTAC 
AATGCGGCCATCAGCTTGGGCTATGCCAACAGCTGCCTCAATCCCTrTGTG 

30 TACATAGTACTCTGTGAGACCTTTCGAAAACGCTTGGTGCTGTCGGTGAA 
GCCCGCGGCCCAGGGGCAGCTTCGCACGGTCAGCAATGCTCAGACAGCTG 
ACGAGGAGAGGACAGAAAGCAAAGGCACCATGGTGAGCAAGGGCGAGG 
AGCTGTTCACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTA 
AACGGCCACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCT 

35 ACGGCAAGCTGACCCTGAAGTTCATCTGCACCACCGGGAAGCTGCCCGTG 
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ccctckk:ccaccctcgtoaccaccctgacctacggcgtgcagtgcttcag 

CCGCTACCCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGC 
CCGAAGGCTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAAC 
TACAAGACCCGCGCCGAGGTGAAGTrCGAGGGCGACACCCTGGTGAACC 

5 GCATCGAGCTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGG 
GCACAAGCTGGAGTACAACTACAACAGCCACAACGTCTATATCATGGCCG 
ACAAGCAGAAGAACGGCATCAAGGTGAACTTCAAGATCCGCCACAACAT 
CGAGGACGGCAGCGTGCAGCTCGCCGACCACTACCAGCAGAAGACCCCC 
ATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCCA 

10 GTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGC 
TGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTAC 

AAGTAA 

SEQ. ID. NO. 28: Human short form/mouse species chimeric MCHlR-linker- 
15 EGFP 

Nucleic acid sequence start codon and start and stop codons for mouse MCH1R and 
EGFP, respectively, are highlighted and a 12 amino acid linker sequence is denoted in 
lower case: 

ATGGACCTGGAAGCCTCGCTGCTGCCCACTGGTCCCAATGCCAGCAACAC 

20 CTCTGATGGCCCCGATAACCTCACTTCGGCAGGATCACCTCCTCGCACGG 
GGAGCATGTCCTACATCAACATCATCATGCCTTCGGTGTTCGGCACCATCT 
GCCrCCTGGGCATCATCGGGAACTCCACGGTCATCTTCGCGGTCGTGAAG 
AAGTCCAAGCTGCACTGGTGCAACAACGTCCCCGACATCTTCATCATCAA 
CCrCTCGGTAGTAGATCTCCTCmCTCCTGGGCATGCCCTTCATGATCCA 

25 CCAGCrcATGGGCAATGGGGTGTGGCACTTTGGGGAGACCATGTGCACCC 
TCATCACGGCCATGGATGCCAATAGTCAGTTCACCAGCACCTACATCCTG 
ACCGCCATGGCCATTGACCGCrACCTGGCCACTGTCCACCCCATCTCTTCC 
ACGAAGTTCCGGAAGCCCTCCATGGCCACCCTGGTGATCTGCCTCCTGTG 
GGCTCTCTCGTTCATTAGCATCACTCCTGTGTGGCTCTATGCCAGGCTTAT 

30 CCCCTTCCCAGGGGGTGCTGTGGGCTGTGGCATCCGCCTACCAAACCCAG 
ATACTGATCTrTACTGGTTCACTCTGTATCAGTTTTTCCTGGCCTTCGCCCT 
TCCGTITGTGGTCATCACTGCTGCGTACGTGAAAATACTACAGCGCATGAC 
GTCTTCGGTGGCCCCAGCCTCTCAACGCAGCATCCGGCTTCGGACAAAGA 
GGGTGACCCGCACAGCCATTGCCATCrGTCTGGTCTTCTTTGTGTGCTGGG 

35 CGCCCTACTACGTGCTGCAGCTGACCCAGTTGTCCATCAGCCGCCCGACC 
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CTCACATTCGTCTACCTGTACAATGCGGCCATCAGCTTGGGCTATGCCAAC 
AGCTGCCTCAATCCCTTTGTGTACATAGT^ 
CGCTTGGTGCTGTCGGTGAAGCCCGCGGCCCAGGGGCAGOT 
CAGCAATGCTCAGACAGCTGACGAGGAGAGGACAGAAAGCAAAGGCACC 
5 gtcgacggtaa;gcgggcccgggatccatcgccaccATGGTGAGGAAGGGCGAGGAGCTGTT 

caccggggtggtgcccatcctggtcgagctggacggcgacgtaaacggcc 
acaagttcagcgtgtccggcgagggcgagggcgatgccacctacggcaa 
gctcaccctgaagttcatctgcaccaccggcaagctgcccgtgccctggc 
ccaccxh'cgtgaccaccctgacctacggcgtgcagtgcttcagccgctac 

10 cccgaccacatgaagcagcacgacttcttcaagtccgccatgcccgaagg 
ctacgtccaggagcgcaccatcttcttcaaggacgacggcaactacaaga 
cccgcgccgaggtgaagttcgagggcgacaccctggtgaaccgcatcgag 
ctgaagggcatcgacttcaaggaggacggcaacatcctggggcacaagc 
tggagtacaactacaacagccacaacgtctatatcatggccgacaagcag 

15 aagaacggcatcaaggtgaacttcaagatccgccacaacatcgaggacg 
gcagcgtgcagctcgccgaccactaccagcagaacacccccatcggcgac 
ggccccgtgctgctgcccgacaaccactacctgagcacccagtccgccct 
gagcaaagaccccaacgagaagcck:gatcacatggtcctgctggagttcg 
tgaccgccgccgggatcactctcggcatggacgagctgtacaagtaa 

20 

SEQ. ID. NO. 29: Human long form/mouse species chimeric MCHlR-linker- 
EGFP 

Nucleic acid sequence start codon and start and stop codons for mouse MCH1R and 
EGFP, respectively, are highlighted and a 12 amino acid linker sequence is denoted in 
25 lowercase: 

ATGTCAGTGGGAGCCATGAAGAAGGGAGTGGGGAGGGCAGTTGGGCTTG 
GAGGCGGCAGCGGCTGCCAGGCTACGGAGGAAGACCCCCTTCCCAACTGC 
GGGGCTTGCGCTCCGGGACAAGGTGGCAGGCGCTGGAGGCTGCCGCAGC 
CTGCGTGGGTGGAGGGGAGCTCAGCTCGGTTGTGGGAGCAGGCGACCGG 

30 CACIGGCTGGATGGACCrGGAAGCCrCGCrGCrGCCCACTGGTCCCAACG 
CCAGCAACACCTCTGATGGCCCCGATAACCTCACTTCGGCAGGATCACCT 
CCTCGCACGGGGAGCATCTCCTACATCAACATCATCATGCCTrcGGTGTrc 
GGCACCATCTGCCTCCTGGGCATCATCGGGAACTCCACGGTCATCTTCGCG 
GTCGTGAAGAAGTCCAAGCTGCACTGGTGCAACAACGTCCCCGACATCTT 

35 CATCATCAACCTCrCGGTAGTAGATCrCCTCTTTCrCCTGGGCATGCCCTT 

-31- 



WO 01/68706 



PCT/US01/08071 



CATGATCCACCAGCTCATGGGCAATGGGGTGTGGCACnTGGGGAGACCA 
TGTGCACCCTCATCACGGCCATGGATGCCAATAGTCAGTTCACCAGCACC 
TACATCCTGACCGCCATGGCCATTGACCGCTACCTGGCCACTGTCCACCCC 
ATCTCrrCCACGAAGTTCCGGAAGCCCTCCATGGCCACCCTGGTGATCTGC 
5 CTCCTGTGGGCTCTCTCGTTCATTAGCATCACTCCTGTGTGGCTCTATGCC 
AGGCTTATCCCCTTCCCAGGGGGTGCTGTGGGCTGTGGCATCCGCCTACCA 
AACCCAGATACTGATUmACTGGTTCACTCTGTATCAGTITrTCCTGGCCT 
TCGCCCTTCCGTTTGTGGTCATCACTGCTGCGTACGTGAAAATACTACAGC 
GCATGACGTCTTGGGTGGCCCCAGCCTCTCAACGCAGCATCCGGCTTCGG 
10 ACAAAGAGGGTGACCCGCACAGCCATTGCCATCTGTCTGGTCTTCTTTGTG 
TGCTGGGCGCCCTACTACGTGCTGCAGCTGACCCAGTTGTCCATCAGCCGC 
CCGACCCTCACATrCGTCTACCTGTACAATGCGGCCATCAGCTTGGGCTAT 
GCCAACAGCTGCCTCAATCCCTTTGTGTACATAGTACTCTGTGAGACCTTT 
CGAAAACGCTTGGTGCTGTCGGTGAAGCCCGCGGCCCAGGGGCAGCTTCG 
15 CACGGTCAGGAATGCTCAGACAGCTGACGAGGAGAGGACAGAAAGCAAA 
GGCACCgtogacggtaccgcgggcccgggatccatcgccaccATGGTGAGCAAGGGCGAGGA 
GCTGTTCACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAA 
ACGGCCACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTAC 
GGCAAGCTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCC 
20 CTGGCCCACGCTCGTGACCACCCTGACCTACGGCGTGCAGTGCTrCAGCC 
GCTACCCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCC 
GAAGGCTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAACTA 
CAAGACCCGCGCCGAGGTGAAGTrCGAGGGCGACACCCTGGTGAACCGC 
ATCGAGCTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGGGC 
25 ACAAGCTGGAGTACAACTACAACAGCCACAACGTCTATATCATGGCCGAC 
AAGCAGAAGAACGGCATCAAGGTGAACTTCAAGATCCGCCACAACATCG 
AGGACGGCAGCGIGCAGCTCGCCGACCACTACCAGCAGAACACCCCCATC 
GGCGACGGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCCAGTC 
CGCCCTGAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGG 
30 AGTTCGTGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAG 

TAA 

Exam ple 2: Gener^™ nf Oiimftric and Fusion Proteins 

DNA vectors encoding fusion proteins between a MCH-R receptor 
35 (MCH1R) and several different superbright variants of Green Fluorescent Protein 
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(GFP) were generated. GFP variants were fused either via a 12 amino acid linker 
TCGACGGTACCGCGGGCCCGGGATCCATCGCCACC (SEQ. ID. NO. 30), 
amino acid sequence: VDGTAGPGSIAT (SEQ. ID. NO. 31) (linker fusions) or 



Green Fluorescent Protein (EGFP). MCH1R was PCR-amplified (95°C for 5 
minutes; 95°C for 30 seconds, 60°C for 45 seconds, 68°C for 3.5 minutes, for 15 

10 cycles; 68°C for 7 minutes) from a full-length mouse MCH1R genomic DNA lambda 
clone utilizing a high fidelity polymerase mix (Expand High Fidelity PCR System 
from Boehringer Mannheim) and PCR primers [MCH1R (Eco RI) 5*: 
GCGAATTCACCATGGATCTGCAAGCCTCG (SEQ. ID. NO. 32), MCH1R (Sal I) 
3': GCGTCGACGGTGCCTTTGCTTTCTGTCC (SEQ. ID. NO. 33)] that generated 

15 Eco RI and Sal I enzymatic restriction sites at the N- and C-terminus, respectively. 
The MCH1R N-terminal PCR primer was also designed to introduce a Kozak 
consensus sequence for translation which contained an Nco I site (5'-ACCATGG-3'), 
and the MCH1R C-terminal PCR primer was also designed to eliminate the 
endogenous stop codon present in the mouse MCH1R gene. The resulting PCR 

20 product was phenol/chloroform extracted, restriction digested with Eco RI and Sal I, 
gel purified, and subcloned in frame into the multicloning site of Clontech's pEGFP- 
N3 vector between Eco RI and Sal I sites. Several resulting clones for this construct 
were sequenced to identify a clone with an entirely correct nucleotide sequence. This 
clone was named mMCHlR-l-EGFP for mouse MCHlR-linker-EGFP. 

25 An approximately 760 bp Sal I to Not I fragment of mMCHlR-l-EGFP 

was excised, gel purified, and subcloned into the multicloning site of pBluescript 
(SK+) (Stratagene) between Sail and Not I sites. An approximately 710 bp Nco I to 
Bsr Gl fragment of EGFP was excised from the resulting pBluescript-EGFP vector 
and replaced with the corresponding Nco I to Bsr Gl fragment of either Emerald, 

30 Topaz, or W1B (other superbright GFP variants), which were excised from vectors 
pRSET-Emerald, pRSET-Topaz, and pRSET-WlB, respectively. pRSET-Emerald, 
pRSET-Topaz, and pRSET-WlB were obtained from Aurora Biosciences Co. Sal I to 
Not I fragments containing either Emerald, Topaz, or W1B were excised from the 
resulting pBluescript-Emerald, pBluescript-Topaz, and pBluescript-WlB vectors, 

35 respectively. Appropriate fragments were gel purified and subcloned into mMCHlR- 



directly to the C-terminus of MCH1R (direct fusions). 



5 



Mouse MCH1 R-linker-GFP Variant Fusion Constructs 
Initially, mouse MCH1R was fused in frame via the linker to Enhanced 
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1-EGFP digested with Sal I and Not I, replacing the Sal I to Not I EGFP fragment with 
the corresponding Sal I to Not I fragment from either Emerald, Topaz, or W1B. 
Several clones for each construct were sequenced to confirm the presence of the 
appropriate GFP variant The resulting vectors were named mMCHlR-l-Emerald, 
5 mMCHlR-l-Topaz, and mMCHlR-l-WlB for mouse MCHlR-linker-Emerald, 
mouse MCHlR-linker-Topaz, and mouse MCHlR-linker-WlB, respectively. 

Mouse MCH1R/GFP Variant Direct Fusion Constructs 

A two step PCR strategy was employed to generate the direct fusion 

10 constructs. First, mouse MCH1R, EGFP, and Emerald were PCR-amplified from a 
full-length mouse MCH1R genomic DNA lambda clone, Clontech's pEGFP-N3 
vector, and Aurora's pRSET-Emerald vector, respectively. Mouse MCH1R was 
PCR-amplified according to the previously mentioned conditions utilizing the same 
N-terminal PCR primer [MCH1R (Eco RI) 5' : GCGAATTCACCATGGATCTGCA 

15 AGCCTCG (SEQ. ID. NO. 32)], but in this case a different C-terminal PCR primer 
was employed. The C-terminal PCR primer [MCH1R (EGFP/Emerald) 3': 
CCTTGCTCACCATGGTGCCTITC (SEQ. ID. NO. 34)] eliminated 

the endogenous stop codon of mouse MCH1R as before and introduced a region of 
nucleotide sequence complementary to the nucleotide sequence of the N-terminus of 

20 EGFP. 

EGFP and Emerald were PCR-amplified (95°C for 5 minutes; 95°C for 
30 seconds, 60°C for 45 seconds, 68°C for 1.5 minutes, for 15 cycles; 68°C for 7 
minutes) separately with a high fidelity polymerase mix (Advantage HF-2 from 
Clontech) from their respective templates utilizing a common N-tenninal PCR primer 

25 [EGFP/Emerald (MCH1R) 5': CAGAAAGCAAAGGCACCATGGTGAGCAA 
GGGCGAGGAGC (SEQ. ID. NO. 35)] that generated a region of nucleotide 
sequence complementary to the C-terminus of mouse MCH1R and C-terminal PCR 
primers [EGFP 3': GGCGGATCCTCTAGAGTCGCGGCC (SEQ. ID. NO. 36), or 
Emerald (EGFP) 3': GCTCTAGAGTCGCGGCCGCTTACITGTACAGCTCGTCC 

30 (SEQ. ID. NO. 37)] that generated a Not I site at the C-terminus. The resulting PCR 
products were electrophoresed on an agarose gel and the appropriate fragments were 
gel purified. 

In a second PCR step, PCR reactions were set up between the 
previously generated mouse MCH1R and EGFP, or mouse MCH1R and Emerald 
35 PCR products. Following an initial 5 minute denaturation step at 95°C, two rounds of 
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thermocycling (95°C for 30 seconds, 60°C for 45 seconds, 68°C for 4 minutes) were 
performed in the absence of PCR primers. This allowed the mouse MCH1R and GFP 
variants to anneal at their complementary regions and to be filled in by the high 
fidelity polymerase mix (Expand High Fidelity PCR System from Boehringer 
5 Mannheim), yielding double stranded template DNA. 

Subsequently, the common N-terminal mouse MCH1R [MCH1R (Eco 
RI) 5': GCGAATTCACCATGGATCTGCAAGCCTCG (SEQ. ID. NO. 32)] and 
appropriate C-tenninal PCR primers [EGFP 3':* GGCGGATCCTCTAGAGTC 
GCGGCC (SEQ. ID. NO. 36) or Emerald (EGEP) 3': GCTCTAGAGTCGCGG 

10 CCGCTTACITGTACAGCTCGTCC (SEQ. ID. NO. 37)] were added to the 

reactions and thermocycling was continued for an additional fifteen cycles followed 
by a final extension at 68°C for 7 minutes. The resulting PCR products were 
phenol/chloroform extracted, restriction digested with Eco RI and Not I, 
electrophoresed on an agarose gel, and appropriate fragments were gel purified. 

15 These Eco RI to Not I fragments represent direct fusions between 

either mouse MCH1R and EGFP, or mouse MCH1R and Emerald. Clontech's 
pEGFP-N3 vector was restriction digested with Eco RI and Not I liberating an 
approximately 780 bp Eco RI to Not I EGFP fragment. This restriction digest was 
electrophoresed on an agarose gel and the approximately 3.9 Kb pEGFP-N3 vector 

20 backbone was gel purified. Eco RI to Not I mouse MCH1R/EGFP or mouse 

MCHIR/Emerald direct fusion fragments were subcloned into the pEGFP-N3 vector 
backbone between Eco RI and Not I sites. Several resulting clones for each of these 
two constructs were sequenced to identify clones with correct nucleotide sequence; 
however, no clones with entirely correct nucleotide sequences ware identified 

25 Fortunately, several clones for each of the two constructs only had nucleotide 

mismatches in the intron region of mouse MCH1R, and therefore, were not expected 
to effect the functionality of the resulting fusion proteins. These clones were named 
mMCHlR/EGFP and mMCHlR/Emerald for mouse MCH1R/EGFP direct fusion and 
mouse MCHIR/Emerald direct fusion, respectively. 

30 

Human Short and Long Form/Mouse Species Chimeric 

MCHlR-linker-GFP Variant Fusion Constructs 
The initial mouse MCHlR-linker-GFP variant fusion constructs were 
modified to generate both human short form and human long form/mouse species 
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chimeric MCHlR-linker-GFP variant fusion constructs. An approximately 1.7 kb 
Hind IE to Bsp EI fragment of the mouse MCH1R gene containing exon 1, the intron, 
and 127 amino acids of exon 2 was excised from the various mouse MCHlR-linker- 
GFP variant fusion constructs and replaced by either an approximately 470 bp Hind 
5 m to Bsp EI fragment from the wild-type human MCH1R short form or an 

approximately 670 bp Hind HI to Bsp EI fragment from the wild-type human MCH1R 
long form. 

Several clones for each construct were sequenced to confirm the 
presence of the N-terminal region of either the human MCH1R short or long forms. 
10 These clones were named hshort/mMCHlR-l-GFP variant or hlong/mMCHlR-l-GFP 
variant for human short form/mouse species chimeric MCHlR-linker-GFP variant 
and human long form/mouse species chimeric MCHlR-linker-GFP variant, 
respectively. 

15 F.**mple 3: Functional Evaluation of MCH1R/GFP V ariant Fusion Proteins 

Both HEK293 Aequorin (National Institutes of Health) and CHO 
mammalian cell lines were transiently transfected with the various MCH1R/GFP 
variant fusion constructs, as well as the appropriate control constructs. Transfection 
was performed using Lipofectamine 2000 (Gibco BRL) per the manufacturer 

20 recommended protocol. Approximately 48 hours after transfection cells were 

harvested, stimulated with various concentrations of human MCH, and assayed for 
either aequorin bioluminescence (HEK293 Aequorin cells) or cAMP production 
(CHO cells). Aequorin bioluminescence is a representative measure of intracellular 
Ca 2+ mobilization. cAMP production was measured with the Adenylyl Cyclase 

25 Activation FlashPlate Assay (NEN life Science Products, tic). 

Following transient transfection of the mMCHlR-linker-EGFP 
construct (MCH-R-1-EGFP) into HEK293 Aequorin cells, the resulting fusion protein 
exhibited functional activity comparable to that of the wild-type human MCH1R short 
form (MCH-R wt). By this functional assay, the EC50 value for mMCHlR-l-EGFP 

30 was nearly identical to that of the wild-type human short fonn receptor (Figure 1). 

Following transient transfections of the mMCHlR-l-EGFP and 
mMCHlRyEGFP fusion constructs into CHO cells, the resulting fusion proteins 
exhibited functional activity comparable to that of the wild-type human MCH1R short 
form. By this functional assay, the EC50 values for mMCHlR-l-EGFP and 

35 mMCHlR/EGFP were comparable to that of the wild-type human receptor (Table 1). 
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Transient transfections with the corresponding Emerald constructs yielded similar 
results (data not shown). 



Table 1 

5 



Receptor 


EC5o(nM) 


Wild-type Human MCH1R Short Form 


2.166 


Mouse MCH1R/EGFP 


0.819 


Mouse MCH1R-1-EGFP 


3.199 



Following transient transfections of the human short form/mouse 
species chimeric MCH1R-1-EGEP (HuShort/mMCHlR-l-EGFP) and human long 
form/mouse species chimeric MCH1R-1-EGFP (HuLong/mMCHlR-l-EGFP) 
10 constructs into HEK293 cells, the resulting fusion proteins exhibited functional 
activity comparable to that of the wild-type human MHC1R short and long forms, 
respectively. By this functional assay, the EC50 value for each fusion proteins was 

nearly identical to that of the corresponding wild-type human receptor (Table 2). 
15 Table 2 



Receptor 


EC50 (nM) 


Wad-type Human MCH1R Short Form 


22.27 


HuShort/mMCHlR-l-EGFP 


19.54 


Wild-type Human MCH1R Long Form 


196.7 


HuLong/mMCHlR-l-EGFP Form 


217.5 



Following transient transfections of the human short form/mouse 
species chimeric MCH1R-1-EGFP (HuShort/mMCHlR-l-EGFP) and human long 
20 form/mouse species chimeric MCH1R-1-EGFP (HuLong/mMCHlR-l-EGFP) 

constructs into CHO cells, the resulting fusion proteins exhibited functional activity 
comparable to or less than that of the wild-type human MHC1R short and long forms, 
respectively (Table 3). By this functional assay, the EC50 value for the human short 

form/mouse species chimeric MCH1R-1-EGFP fusion protein was comparable to that 
25 of the corresponding wild-type human receptor, whereas, the human long form/mouse 
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species chimeric MCH1R-1-EGFP fusion protein had an EC50 value approximately 
7.5-fold higher than that of its corresponding wild-type control. 



Table 3 

5 . 



Receptor 


EC50 (nM) 


Wild-type Human MCH1R Short Form 


1.029 


Wild-type Human MCH1R Long Form 


1.515 


HuShort/mMCHlR-l-EFGP 


1.565 


HuLong/mMCHlR-l-EGFP 


11.580 



Transient expression of all the MCH1R/GFP variant fusion proteins 
that underwent functional evaluation resulted in fluorescence primarily associated 
with the plasma membrane in both HEK293 and CHO cells (data not shown). This 
10 pattern of fluorescence is consistent with a predominant membrane associated 
localization. 



Example 4: Generation of Stable Cell lines 

Wild-type CHO cells were transfected using SuperFect (Qiagen) and 

15 either mouse MCH-1R-EGFP or human short/mouse species chimeric MCH-1R- 
EGFP. Forty-eight hours after transfection, transfected cells were subjected to 
positive selection for approximately ten days in media containing G418. Following 
selection, MCH-1R-EGFP expressing CHO cells were bulk sorted by Fluorescence 
Assisted Cell Sorting (FACS) for one or two rounds on the basis of fluorescence 

20 intensity to increase the population of cells expressing EGFP. Following bulk sorts, 
individual clones of varying fluorescence intensities were isolated by FACS and 
expanded 

Fluorometric Microvolume Assay Technology (FMAT) was initially 
employed to screen a large number of stable clones by whole cell binding with a 
25 fluorescently labeled MCH derivative (SymJz-MCH, PE Biosystems) to identify those 
clones with good specific binding windows. Several clones exhibiting specific 
binding windows greater than 3-fold were further evaluated for MCH binding with the 
SPA-based Binding Assay. Cells from individual clones were dissociated in enzyme 
free dissociation media and cell membranes were prepared and subsequently tested for 
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their ability to bind [ 125 I]Phe 13 Tyr 19 -MCH in the presence of human MCEL CHO cell 
lines expressing either mouse MCH-1R-EGFP or human short/mouse species 
chimeric MCH-1R-EGFP (Figure 4) displayed IC50 values with MCH that were 
indistinguishable from the corresponding IC50 values obtained with a CHO cell line 
5 expressing the wild-type human short isof orm of MCH-1R. 

The functional activity of these clones was evaluated with the cAMP 
Flashplate Assay (Figures 2 and 3). CHO cell lines expressing either mouse MCH- 
1R-EGFP (Figure 2) or human short/mouse species chimeric MCH-1R-EGFP (Figure 
3) displayed EC50 values with human MCH that were indistinguishable from the 

10 EC50 value obtained with a CHO cell line expressing the wild-type human short 
isoformofMCH-lR. 

The subcellular localization of the MCH-1R-EGFP fusion proteins 
were determined by confocal microscopy utilizing EGFP fluorescence as a marker for 
MCH-1R expression. CHO cell lines stably expressing either mouse MCH-1R-EGFP 

15 or human short/mouse species chimeric MCH-1R-EGFP displayed EGFP 

fluorescence primarily associated with the plasma membrane, demonstrating that 
these MCH-1R-EGFP fusion proteins are primarily associated with the plasma 
membrane. 

Other embodiments are within the following claims. While several 
20 embodiments have been shown and described, various modifications may be made 
without departing from the spirit and scope of the present invention. 
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WHAT IS CLAIMED IS : 



1 . A fusion protein comprising: * 

a) a melanin concentrating hormone receptor polypeptide 
5 region comprising a sequence selected from the group consisting of: SEQ. ID. NO. 1, 

SEQ. ID. NO. 2, SEQ. SEQ. ID. NO. 3, SEQ. ID. NO. 4, and SEQ. ID. NO. 5; and 

b) a fluorescent polypeptide region joined directly, or 
though a linker, to the carboxy side of said melanin concentrating hormone receptor 
polypeptide region. 

10 

2. The protein of claim 1 , wherein said fluorescent polypeptide 
region consists of an amino acid sequence selected from the group consisting of SEQ. 
ID. NO. 6, SEQ. ID. NO. 7, SEQ. ID. NO. 8, SEQ. ID. NO. 9, and SEQ. ID. NO. 10. 

15 3. The protein of claim 2, wherein said melanin concentrating 

hormone polypeptide region consists of a sequence selected from the group consisting 
of: SEQ. ID. NO. 1, SEQ. ID. NO. 2, SEQ. ID. NO. 3, SEQ. ID. NO. 4, and SEQ. ID. 
NO. 5. 

20 4. The protein of claim 3, wherein said protein consists essentially 

of said melanin concentrating hormone receptor polypeptide region and said 
fluorescent polypeptide region. 

5 . The protein of claim 4, wherein said protein consists of the 
25 amino acid sequence of SEQ. ID. NO. 1 1 or SEQ. ID. NO. 12. 

6. The protein of claim 1, wherein said melanin concentrating 
hormone polypeptide region is a chimeric polypeptide comprising (a) an MCH 
binding region from a first species and (b) a transmembrane and intracellular domain 

30 region from a second species joined directiy, or though a linker, to the carboxy side of 
said MCH binding region. 



7. The protein of claim 6, wherein said fluorescent polypeptide 
region consists of an amino acid sequence selected from the group consisting of: SEQ. 
35 ID. NO. 6, SEQ. ID. NO. 7, SEQ. ID. NO. 8, SEQ. ID. NO. 9, and SEQ. ID. NO. 10. 



-40- 



WO 01/68706 



PCT/US01/08071 



8. Hie protein of claim 7, wherein said protein consists of the 
amino acid sequence of SEQ. ID. NO. 13 or SEQ. ID. NO. 14. 

5 9. A chimeric melanin concentrating hormone protein comprising: 

a) a melanin concentrating hormone binding region 
characteristic of a human melanin concentrating hormone receptor, 

b) a transmembrane domain characteristic of a non-human 
melanin concentrating hormone receptor; and 

10 c) an intracellular domain characteristic of a non-human 

melanin concentrating hormone receptor. 

10. The protein of claim 9, wherein said protein comprises a 
melanin concentrating hormone receptor polypeptide having a sequence similarity of 

15 at least 75% with either SEQ. ID. NO. 4 or SEQ. ID. NO. 5. 

1 1 . The protein of claim 10, wherein said protein comprises the 
sequence of SEQ. ID. NO. 4 or SEQ. ID. NO. 5. 

20 12. The protein of claim 1 1 , wherein said protein consists of the 

sequence of SEQ. ID. NO. 4 or SEQ. ID. NO. 5. 

13. A nucleic acid comprising a nucleotide sequence encoding for 
the protein of claim 1. 

25 

14. The nucleic acid of claim 13, wherein said nucleotide sequence 
is a contiguous sequence. 

15. The nucleic acid of claim 13, wherein said nucleotide sequence 
30 is selected from the group consisting of SEQ. ID. NO. 26, SEQ. ID. NO. 27, SEQ. ID. 

NO. 28 and SEQ. ID. NO. 29. 

16. A nucleic acid comprising a nucleotide sequence encoding for 
the protein of claim 9. 

35 



-41- 



WO 01/68706 



PCT/US01/08071 



17. The nucleic acid of claim 16, wherein said nucleotide sequence 
is a contiguous sequence. 

18. The nucleic acid of claim 16, wherein said nucleotide sequence 
5 is selected from the group consisting of SEQ. ED. NO. 19 and SEQ. ID. NO. 20. 

19. An expression vector comprising the nucleic acid of claim 13. 

20. An expression vector comprising the nucleic acid of claim 16. 

21. A recombinant cell comprising the nucleic acid of claim 13. 



10 



22. The recombinant cell of claim 21, wherein said nucleic acid is 
present in an expression vector. 

15 

23. The recombinant cell of claim 21, wherein said nucleic acid is 
present in the genome of said cell. 

24. A recombinant cell comprising the nucleic acid of claim 16. 

20 

25. The recombinant cell of claim 24, wherein said nucleic acid is 
present in an expression vector. 

26. The recombinant cell of claim 24, wherein said nucleic acid is 
25 present in the genome of said cell. 

27. A non-human transgenic animal comprising the nucleic acid of 

claim 13. 

30 28. A non-human transgenic animal comprising the nucleic acid of 

claim 16. 

29. A method for assaying for melanin concentrating hormone 
receptor active compounds comprising the steps of: 
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a) contacting the cell of claim 21 with a test preparation 
comprising one or more test compounds; and 

b) measuring the effect of said test preparation on one or 
more melanin concentrating hormone receptor activities. 

5 

30. A method for assaying for melanin concentrating hormone 
receptor active compounds comprising the steps of: 

a) contacting the cell of claim 24 with a test preparation 
comprising one or more test compounds; and 
10 b) measuring the effect of said test preparation on one or 

more melanin concentrating hormone receptor activities. 
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SEQUENCE LISTING 

<110> Merck & Co., inc. 

<1 pS^m^^L concentrating hormone receptor 

CHIMERIC AND FUSION PROTEINS 

<130> 20652 PCT 

<150> 60/189,698 
<151> 2000-03-15 

<160> 37 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 422 
<212> PRT / 
<213> Human 

<400> 1 

Met Ser Val Gly Ala Met Lys Lys Gly Val Gly Arg Ala Val Gly Leu 
Gly Gly Gly Ser Gly Cys Gin Ala Thr Gl°u Glu Asp Pro Leu Pro Asn 
Cys Gly Ala Cys Ala Pro Gly Gin Gly Gly Arg Arg Trp Irg Leu Pro 
Gin Pro Ala Trp Val Glu Gly Ser Ser Ala Arg Leu £rp Glu Gin Ala 
Thr Gly Thr Gly Trp Met Asp Leu Glu Ala Ser Leu Leu Pro Thr Gly 
Pro Asn Ala Ser Asn Thr Ser Asp Gly Pro Hp Asn Leu Thr Ser 2a 
Gly Ser Pro Pro Arg Thr Gly Ser lie llr Tyr He Asn lie Ue Met 
Pro Ser Val Phe Gly Thr lie Cys Leu Leu Gly lie lie oly Asn Ser 
Thr val lie Phe Ala Val Val Lys Lys Ser Lys Leu S Trp Cys Asn 



Asn val Pro Asp lie Phe lie lie Asn Leu Ser Val Val Asp Leu Leu 

Phe Leu Leu Gly Met Pro Phe Met He His g" Leu Met Gly Asn oly 

Val Trp His Phe Gly Glu Ofcr Met Cys Sr Leu lie Thr Ala Met Asp 

Ala Asn ser Gin Phe l*r Ser Thr Tyr He Leu Thr Ala Si Ala He 

^00 DOR 

Asp Arg Tyr Leu Ala Thr Val His Pro lie Ser Ser Thr Lys Phe Arg 
Lys Pro Ser Val Ala Thr Leu Val lie Cys Leu Leu Trp Ala Leu Ser 
Phe lie ser He Thr Pro Val Trp Leu Tyr Ma Arg Leu He Pro Pne 
Pro Gly Gly Ala Val Gly Cys Gly lie Arg Leu Pro Asn Pro Asp Thr 
Asp Leu Tyr Trp Phe Thr Leu Tyr Gin Phe Phe Leu Ala Phe Ala Leu 
Pro Phe Val Val He Thr Ala Ala Tyr Val Arg He Leu Gin Arg Met 



300 
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Thr Ser Ser Val Ala Pro Ala Ser Gin Arg Ser He Arg Leu Arg Thr 
305 310 315 320 

Lys Arg Val Thr Arg Thr Ala He Ala He Cys Leu Val Phe Phe Val 

325 330 335 

Cys Trp Ala Pro Tyr Tyr Val Leu Gin Leu Thr Gin Leu Ser He Ser 

340 345 350 

Arg Pro Thr Leu Thr Phe Val Tyr Leu Tyr Asn Ala Ala He Ser Leu 

355 360 365 

Gly Tyr Ala Asn Ser Cys Leu Asn Pro Phe Val Tyr He Val Leu Cys 

370 375 380 

Glu Thr Phe Arg Lys Arg Leu Val Leu Ser Val Lys Pro Ala Ala Gin 
385 390 395 400 

Gly Gin Leu Arg Ala Val Ser Asn Ala Gin Thr Ala Asp Glu Glu Arg 

405 410 415 

Thr Glu Ser Lys Gly Thr 



<210> 2 
<211> 353 
<212> PRT 
<213> Human 

<400> 2 

Met Asp Leu Glu Ala Ser Leu Leu Pro Thr Gly Pro Asn Ala Ser Asn 

1 5 10 15 

Thr Ser Asp Gly Pro Asp Asn Leu Thr Ser Ala Gly Ser Pro Pro Arg 

20 25 30 

Thr Gly Ser He Ser Tyr He Asn He He Met Pro Ser Val Phe Gly 

35 40 45 

Thr He Cys Leu Leu Gly He He Gly Asn Ser Thr Val He Phe Ala 

50 55 60 

Val Val Lys Lys Ser Lys Leu His Trp Cys Asn Asn Val Pro Asp He 
65 70 75 80 

Phe He He Asn Leu Ser Val Val Asp Leu Leu Phe Leu Leu Gly Met 

85 90 95 

Pro Phe Met He His Gin Leu Met Gly Asn Gly Val Trp His Phe Gly 

100 105 HO 

Glu Thr Met Cys Thr Leu He Thr Ala Met Asp Ala Asn Ser Gin Phe 

115 120 125 

Thr Ser Thr Tyr He Leu Thr Ala Met Ala He Asp Arg Tyr Leu Ala 

130 135 140 

Thr Val His Pro He Ser Ser Thr Lys Phe Arg Lys Pro Ser Val Ala 
145 150 155 160 

Thr Leu Val He Cys Leu Leu Trp Ala Leu Ser Phe He Ser He Thr 

165 170 175 

Pro Val Trp Leu Tyr Ala Arg Leu He Pro Phe Pro Gly Gly Ala Val 

180 185 190 

Gly Cys Gly He Arg Leu Pro Asn Pro Asp Thr Asp Leu Tyr Trp Phe 

195 200 205 

Thr Leu Tyr Gin Phe Phe Leu Ala Phe Ala Leu Pro Phe Val Val He 

210 215 220 

Thr Ala Ala Tyr Val Arg He Leu Gin Arg Met Thr Ser Ser Val Ala 
225 " 230 235 240 

Pro Ala Ser Gin Arg Ser He Arg Leu Arg Thr Lys Arg Val Thr Arg 

245 250 255 

Thr Ala He Ala He Cys Leu Val Phe Phe Val Cys Trp Ala Pro Tyr 

260 265 270 

Tyr Val Leu Gin Leu Thr Gin Leu Ser He Ser Arg Pro Thr Leu Thr 

275 280 285 

Phe Val Tyr Leu Tyr Asn Ala Ala He Ser Leu Gly Tyr Ala Asn Ser 



420 



290 



295 



300 
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Cys Leu Asn 


jfro 


rile 


val 


Tyr 


lie 


Val 


305 








310 








Arg Leu Val 


Leu 


Ser 


Val 


Lys 


Pro 


Ala 








325 










Val Ser 


Asn 


Ala 


Gin 


Thr 


Ala 


Asp 


Glu 






340 










345 


Thr 
















<210> 3 
















<211> 353 














<212> PRT 














<213> Mouse 














<400> 3 
















Met Asp 


Leu 


Gin 


Ala 


Ser 


Leu 


Leu 


Ser 


1 






5 










He Ser 


Asp 


Gly 


Gin 


Asp 


Asn 


Phe 


Thr 






20 










25 


Thr Arg 


Ser 


Val 


Ser 


Tyr 


He 


Asn 


He 




35 










40 




Thr He 


Cys 


Leu 


Leu 


Gly 


He 


Val 


Gly 


50 










55 






Val Val 


Lys 


Lys 


Ser 


Lys 


Leu 


His 


Trp 


65 








70 








Phe He 


He 


Asn 


Leu 


Ser 


Val 


Val 


Asp 








85 










Pro Phe 


Met 


He 


His 


Gin 


Leu 


Met 


Gly 






100 










105 


Glu Thr 


Met 


Cys 


Thr 


Leu 


He 


Thr 


Ala 




115 








120 




Thr Ser 


Thr 


Tyr 


He 


Leu 


Thr 


Ala 


Met 


130 










135 






Thr Val 


His 


Pro 


He 


Ser 


Ser 


Thr 


Lys 


145 








150 








Thr Leu 


Val 


He 


Cys 


Leu 


Leu 


Trp 


Ala 








165 










Pro Val 


Trp 


Leu 


Tyr 


Ala 


Arg 


Leu 


He 






180 










185 


Gly Cys 


Gly 


He 


Arg 


Leu 


Pro 


Asn 


Pro 


195 










200 




Thr Leu 


Tyr 


Gin 


Phe 


Phe 


Leu 


Ala 


Phe 


210 








215 






Thr Ala 


Ala 


Tyr 


Val 


Lys 


He 


Leu 


Gin 


225 








230 








Pro Ala 


Ser 


Gin 


Arg 


Ser 


He 


Arg 


Leu 








245 










Thr Ala 


He 


Ala 


He 


Cys 


Leu 


Val 


Phe 






260 










265 


Tyr Val 


Leu 


Gin 


Leu 


Thr 


Gin 


Leu 


Ser 


275 










280 




Phe Val 


Tyr 


Leu 


Tyr 


Asn 


Ala 


Ala 


He 


290 










295 






Cys Leu 


Asn 


Pro 


Phe 


Val 


Tyr 


He 


Val 


305 








310 








Arg Leu 


Val 


Leu 


Ser 


Val 


Lys 


Pro 


Ala 








325 










Val Ser 


Asn 


Ala 


Gin 


Thr 


Ala 


Asp 


Glu 



340 345 



Thr 
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Leu Cys 


Glu Thr Phe 


Arg 


Lys 


315 






320 


Ala Gin 


Gly Gin Leu 


Arg 


Ala 


330 


335 




Glu Arg 


Thr Glu Ser 


Lys 


Gly 




350 






Thr Gly 


Pro Asn Ala 


Ser 


Asn 


10 




15 




Leu Ala 


Gly Pro Pro 


Pro 


Arg 




30 






He Met 


Pro Ser Val 


Phe 


Gly 




45 . 






Asn Ser 


Thr Val He 


Phe 


Ala 




60 






Cys Ser 


Asn Val Pro 


Asp 


He 


75 




80 


Leu Leu 


Phe Leu Leu 


Gly 


Met 


90 




95 




Asn Gly 


Val Trp His 


Phe 


Gly 




110 






Met Asp 


Ala Asn Ser 


Gin 


Phe 


125 






Ala He 


Asp Arg Tyr 


Leu 


Ala 




140 






Phe Arg 


Lys Pro Ser 


Met 


Ala 


155 






160 


Leu Ser 


Phe He Ser 


He 


Thr 


170 




175 




Pro Phe 


Pro Gly Gly 


Ala 


Val 




190 






Asp Thr 


Asp Leu Tyr 


Trp 


Phe 




205 






Ala Leu 


Pro Phe Val 


Val 


He 




220 






Arg Met 


Thr Ser Ser 


Val 


Ala 


235 






240 


Arg Thr 


Lys Arg Val 


Thr 


Arg 


250 




255 




Phe Val 


Cys Trp Ala 


Pro 


Tyr 




270 






He Ser 


Arg Pro Thr 


Leu 


Thr 




285 






Ser Leu 


Gly Tyr Ala 


Asn 


Ser 




300 






Leu Cys 


Glu Thr Phe 


Arg 


Lys 


315 






320 


Ala Gin 


Gly Gin Leu 


Arg 


Thr 


330 




335 




Glu Arg 


Thr Glu Ser 


Lys 


Gly 




350 
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<210> 4 
<211> 353 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Human short form/mouse species chimeric MCH1R 
<400> 4 

Met Asp Leu Glu Ala Ser Leu Leu Pro Thr Gly Pro Asn Ala Ser Asn 

1 5 10 15 

Thr Ser Asp Gly Pro Asp Asn Leu Thr Ser Ala Gly Ser Pro Pro Arg 

20 25 30 

Thr Gly Ser lie Ser Tyr lie Asn lie He Met Pro Ser Val Phe Gly 

35 40 45 

Thr He Cys Leu Leu Gly He He Gly Asn Ser Thr Val He Phe Ala 

50 55 60 

Val Val Lys Lys Ser Lys Leu His Trp Cys Asn Asn Val Pro Asp He 
65 70 75 80 

Phe He He Asn Leu Ser Val Val Asp Leu Leu Phe Leu Leu Gly Met 

85 90 95 

Pro Phe Met He His Gin Leu Met Gly Asn Gly Val Trp His Phe Gly 

100 105 110 

Glu Thr Met Cys Thr Leu He Thr Ala Met Asp Ala Asn Ser Gin Phe 

115 120 125 

Thr Ser Thr Tyr He Leu Thr Ala Met Ala He Asp Arg Tyr Leu Ala 

130 ^ 135 140 

Thr Val His Pro He Ser Ser Thr Lys Phe Arg Lys Pro Ser Met Ala 
145 150 155 160 

Thr Leu Val He Cys Leu Leu Trp Ala Leu Ser Phe He Ser He Thr 

165 170 175 

Pro Val Trp Leu Tyr Ala Arg Leu He Pro Phe Pro Gly Gly Ala Val 

180 185 190 

Gly Cys Gly He Arg Leu Pro Asn Pro Asp Thr Asp Leu Tyr Trp Phe 

195 200 205 

Thr Leu Tyr Gin Phe Phe Leu Ala Phe Ala Leu Pro Phe Val Val He 

210 ■ 215 220 

Thr Ala Ala Tyr Val Lys He Leu Gin Arg Met Thr Ser Ser Val Ala 
225 230 235 240 

Pro Ala Ser Gin Arg Ser He Arg Leu Arg Thr Lys Arg Val Thr Arg 

245 250 255 

Thr Ala He Ala lie Cys Leu Val Phe Phe Val Cys Trp Ala Pro Tyr 

260 265 270 

Tyr Val Leu Gin Leu Thr Gin Leu Ser He Ser Arg Pro Thr Leu Thr 

275 280 285 

Phe Val Tyr Leu Tyr Asn Ala Ala He Ser Leu Gly Tyr Ala Asn Ser 

290 295 300 

Cys Leu Asn Pro Phe Val Tyr He Val Leu Cys Glu Thr Phe Arg Lys 
305 310 315 320 

Arg Leu Val Leu Ser Val Lys Pro Ala Ala Gin Gly Gin Leu Arg Thr 

325 330 335 

Val Ser Asn Ala Gin Thr Ala Asp Glu Glu Arg Thr Glu Ser Lys Gly 
340 345 350 

Thr 



<210> 5 
<211> 422 
<212> PRT 

<213> Artificial Sequence 
<220> 
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<223> Human long form/mouse species 



<400> 5 



Met 


Ser Val 


Gly 


Ala 


Met 


Lys Lys 


Gly 


1 








5 








Gly 


Gly Gly 


Ser 


Gly 


Cys 


Gin Ala 


Thr 








20 








25 


Cys 


Gly Ala 


Cys 


Ala 


Pro 


Gly Gin 


Gly 






35 








40 




Gin 


Pro 


Ala 


Trp 


Val 


Glu 


Gly Ser 


Ser 




50 










55 




Thr 


Gly Thr 


Gly 


Trp 


Met 


Asp Leu 


Glu 


65 










70 






Pro 


Asn Ala 


Ser 


Asn 


Thr 


Ser Asp 


Gly 










85 








Gly 


Ser 


Pro 


Pro 


Arg 


Thr 


Gly Ser 


He 








100 








105 


Pro 


Ser Val 


Phe 


Gly 


Thr 


He Cys 


Leu 






115 








120 




Thr 


Val 


He 


Phe 


Ala 


Val 


Val Lys 


Lys 




130 










135 




Asn 


Val 


Pro 


Asp 


He 


Phe 


He He 


Asn 


145 








150 






Phe 


Leu 


Leu 


Gly 


Met 


Pro 


Phe Met 


He 










165 








Val 


Trp His 


Phe 


Gly 


Glu 


Thr Met 


Cys 








180 








185 


Ala 


Asn Ser 


Gin 


Phe 


Thr 


Ser Thr 


Tyr 






195 








200 


Asp 


Arg 


Tyr 


Leu 


Ala 


Thr 


Val His 


Pro 




210 










215 




Lys 


Pro 


Ser 


Met 


Ala 


Thr 


Leu Val 


lie 


225 










230 






Phe 


He 


Ser 


He 


Thr 


Pro 


Val Trp 


Leu 










245 








Pro 


Gly 


Gly 


Ala 


Val 


Gly 


Cys Gly 


He 








260 








265 


Asp 


Leu 


Tyr 


Trp 


Phe 


Thr 


Leu Tyr 


Gin 






275 








280 




Pro 


Phe 


Val 


Val 


He 


Thr 


Ala Ala 


Tyr 




290 










295 


Thr 


Ser 


Ser 


Val 


Ala 


Pro 


Ala Ser 


Gin 


305 










310 






Lys 


Arg 


Val 


Thr 


Arg 


Thr 


Ala He 


Ala 










325 








Cys 


Trp 


Ala 


Pro 


Tyr 


Tyr 


Val Leu 


Gin 








340 








345 


Arg 


Pro 


Thr 


Leu 


Thr 


Phe 


Val Tyr 


Leu 




355 








360 




Gly 


Tyr 


Ala 


Asn 


Ser 


Cys 


Leu Asn 


Pro 




370 










375 




Glu 


Thr 


Phe 


Arg 


Lys 


Arg 


Leu Val 


Leu 


385 










390 






Gly 


Gin 


Leu 


Arg 


Thr 


Val 


Ser Asn 


Ala 










405 








Thr 


Glu 


Ser 


Lys 


Gly 


Thr 







420 



<210> 6 
<211> 238 
<212> PRT 

<213> Aequorea Victoria 
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chimeric MCH1R 



Val 


Gly 


Arg 


Ala 


Val 


Gly 


Leu 


10 










15 




Glu 


Glu 


Asp 


Pro 


Leu 


Pro 


Asn 










30 






Gly 


Arg 


Arg 


Trp 


Arg 


Leu 


Pro 








45 








Ala 


Arg 


Leu 


Trp 


Glu 


Gin 


Ala 






60 










Ala 


Ser 


Leu 


Leu 


Pro 


Thr 


Gly 




75 










80 


Pro 


Asp 


Asn 


Leu 


Thr 


Ser 


Ala 


90 










95 




Ser 


Tyr 


He 


Asn 


He 


He 


Met 










110 






Leu 


Gly 


He 


He 


Gly 


Asn 


Ser 






125 






Ser 


Lys 


Leu 


His 


Trp 


Cys 


Asn 






140 










Leu 


Ser 


Val 


Val 


Asp 


Leu 


Leu 




155 










160 


His 


Gin 


Leu 


Met 


Gly 


Asn 


Gly 


170 








175 


Thr 


Leu 


He 


Thr 


Ala 


Met 


Asp 










190 




He 


Leu 


Thr 


Ala 


Met 


Ala 


He 








205 








He 


Ser 


Ser 


Thr 


Lys 


Phe 


Arg 






220 






Cys 


Leu 


Leu 


Trp 


Ala 


Leu 


Ser 




235 










240 


Tyr 


Ala 


Arg 


Leu 


He 


Pro 


Phe 


250 










255 




Arg 


Leu 


Pro 


Asn 


Pro 


Asp 


Thr 










270 






Phe 


Phe 


Leu 


Ala 


Phe 


Ala 


Leu 








285 








Val 


Lys 


He 


Leu 


Gin 


Arg 


Met 






300 










Arg 


Ser 


He 


Arg 


Leu 


Arg 


Thr 




315 










320 


He 


Cys 


Leu 


Val 


Phe 


Phe 


Val 


330 










335 




Leu 


Thr 


Gin 


Leu 


Ser 


He 


Ser 










350 






Tyr 


Asn 


Ala 


Ala 


He 


Ser 


Leu 






365 








Phe 


Val 


Tyr 


He 


Val 


Leu 


Cys 






380 










Ser 


Val 


Lys 


Pro 


Ala 


Ala 


Gin 




395 










400 


Gin 


Thr 


Ala 


Asp 


Glu 


Glu 


Arg 



410 415 
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<400> 6 

Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val 

15 10 15 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 

20 25 30 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 

35 " 40 45 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe 

50 55 60 

Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
65 70 75 80 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 

85 90 95 

Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 

100 105 HO 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 

115 ~ 120 125 

Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 

130 135 140 

Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 160 

He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 

165 170 175 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 

180 185 190 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 

195 200 205 

Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 

210 215 220 

Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 
225 230 235 

<210> 7 
<211> 239 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> GPP derivative 
<400> 7 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 HO 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 

130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 
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Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230. 235 

<210> 8 
<211> 239 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> GFP derivative 
<400> 8 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser "Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 

130 135 140 

Asn Tyr Asn Ser His Lys Val Tyr He Thr Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys Thr Arg His Asn lie Glu Asp Gly Ser 

165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 * 235 

<210> 9 
<211> 239 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> GFP derivative 
<400> 9 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 
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Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 " 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

50 55 60 

Phe Gly Tyr Gly Val Gin Cys Phe Ala Arg Tyr Pro Asp His Met Arg 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 

130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 

195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 235 

<210> 10 
<211> 239 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> GFP derivative 
<400> 10 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Arg Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

50 55 60 

Leu Thr Trp Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 HO 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

115 " 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 

130 135 140 

Asn Tyr He Ser His Asn Val Tyr He Thr Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Ala His Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 
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Pro Val 


Leu 


Leu Pro Asp Asn His 


Tyr Leu Ser Thr Gin Ser 


Ala 


Leu 




195 










200 


A UJ 






Ser Lys Asp 


Pro Asn Glu Lys Arg 


Asp His Met Val Leu Leu 


Glu 


Phe 


210 










215 




220 






Val Thr Ala 


Ala Gly He Thr Leu 


Gly Met Asp Glu Leu Tyr 


Lys 




225 








230 






235 






<210> 11 


















<211> 604 


















<212> PRT 


















<213> Artificial Sequence 










<220> 




















<223> Mouse 


MCHlR-linker-EGFP 








<400> 11 


















Met Asp 


Leu 


Gin 


Ala 


Ser 


Leu 


Leu 


Ser Thr Gly Pro Asn Ala 


Ser 


Asn 


1 






5 








10 


15 




lie Ser 


Asp 


Gly 


Gin 


Asp 


Asn 


Phe 


Thr Leu Ala Gly Pro Pro 


Pro 


Arg 






20 










25 30 






Thr Arg 


Ser 


Val 


Ser 


Tyr 


He 


Asn 


He He Met Pro Ser Val 


Phe 


Gly 




35 










40 


45 






Thr lie 


Cys 


Leu 


Leu 


Gly 


He 


Val 


Gly Asn Ser Thr Val lie 


Phe 


Ala 


50 










55 




60 






Val Val 


Lvs 


Lys 


Ser 


Lys 


Leu 


His 


Trp Cys Ser Asn Val Pro 


Asp 


He 


65 








70 






75 




80 


Phe lie 


lie 


Asn 


Leu 


Ser 


Val 


Val 


Asp Leu Leu Phe Leu Leu 


Gly 


Met 








85 








90 


95 




Pro Phe 


Met 


He 


His 


Gin 


Leu 


Met 


Gly Asn Gly Val Trp His 


Phe 


Gly 






100 










105 110 






Glu Thr 


Met 


Cys 


Thr 


Leu 


He 


Thr 


Ala Met Asp Ala Asn Ser 


Gin 


Phe 




115 








120 


125 






Thr Ser 


Thr 


Tyr 


He 


Leu 


Thr 


Ala 


Met Ala He Asp Arg Tyr 


Leu 


Ala 


130 








135 




140 






Thr Val 


His 


Pro 


He 


Ser 


Ser 


Thr 


Lys Phe Arg Lys Pro Ser 


Met 


Ala 


145 








150 






155 




160 


Thr Leu 


Val 


He 


Cys 


Leu 


Leu 


Trp 


Ala Leu Ser Phe He Ser 


He 


Thr 








165 








170 


175 




Pro Val 


Trp 


Leu 


Tyr 


Ala 


Arg 


Leu 


He Pro Phe Pro Gly Gly 


Ala 


Val 






180 










185 190 






Gly Cys 


Gly 


He 


Arg 


Leu 


Pro 


Asn 


Pro Asp Thr Asp Leu Tyr 


Trp 


Phe 




195 










200 


205 






Thr Leu 


Tyr 


Gin 


Phe 


Phe 


Leu 


Ala 


Phe Ala Leu Pro Phe Val 


Val 


He 


210 










215 




220 






Thr Ala 


Ala 


Tyr 


Val 


Lys 


He 


Leu 


Gin Arg Met Thr Ser Ser 


Val 


Ala 


225 








230 






235 




240 


Pro Ala 


Ser 


Gin 


Arg 


Ser 


He 


Arg 


Leu Arg Thr Lys Arg Val 


Thr 


Arg 








245 








250 


255 




Thr Ala 


He 


Ala 


He 


Cys 


Leu 


Val 


Phe Phe Val Cys Trp Ala 


Pro 


Tyr 






260 










265 270 






Tyr Val 


Leu 


Gin 


Leu 


Thr 


Gin 


Leu 


Ser He Ser Arg Pro Thr 


Leu 


Thr 


275 










280 


285 






Phe Val 


Tyr 


Leu 


Tyr 


Asn 


Ala 


Ala 


He Ser Leu Gly Tyr Ala 


Asn 


Ser 


290 








295 




300 






Cys Leu 


Asn 


Pro 


Phe 


Val 


Tyr 


He 


Val Leu Cys Glu Thr Phe 


Arg 


Lys 


305 








310 






315 




320 


Arg Leu 


Val 


Leu 


Ser 


Val 


Lys 


Pro 


Ala Ala Gin Gly Gin Leu 


Arg 


Thr 






325 








330 


335 




Val Ser 


Asn 


Ala 


Gin 


Thr 


Ala 


Asp 


Glu Glu Arg Thr Glu Ser 


Lys 


Gly 






340 










345 350 






Thr Val 


Asp 


Gly 


Thr 


Ala 


Gly 


Pro 


Gly Ser He Ala Thr Met 


Val 


Ser 




355 








360 


365 
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Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val Glu Leu 

370 375 380 

Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu 
385 390 395 400 

Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys Thr Thr 

405 410 415 

Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr 

420 425 430 

Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His Asp 

435 440 445 

Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr He 

450 455 460 

Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe 
465 470 475 480 

Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He Asp Phe 

485 490 495 

Lvs Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn 

500 505 510 

Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly He Lys 

515 520 525 

Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val Gin Leu 

530 535 540 

Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val Leu 
545 550 555 560 

Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp 

565 570 575 

Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala 

580 585 590 

Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
595 600 

<210> 12 
<211> 592 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mouse MCHlR/EGFP 
<400> 12 

Met Asp Leu Gin Ala Ser Leu Leu Ser Thr Gly Pro Asn Ala Ser Asn 

1 5 10 15 

He Ser Asp Gly Gin Asp Asn Phe Thr Leu Ala Gly Pro Pro Pro Arg 

20 25 30 

Thr Arg Ser Val Ser Tyr He Asn He He Met Pro Ser Val Phe Gly 

35 40 45 

Thr He Cys Leu Leu Gly He Val Gly Asn Ser Thr Val He Phe Ala 

50 55 60 

Val Val Lys Lys Ser Lys Leu His Trp Cys Ser Asn Val Pro Asp He 
65 70 75 80 

Phe He He Asn Leu Ser Val Val Asp Leu Leu Phe Leu Leu Gly Met 

85 90 95 

Pro Phe Met He His Gin Leu Met Gly Asn Gly Val Trp His Phe Gly 

100 105 HO 

Glu Thr Met Cys Thr Leu He Thr Ala Met Asp Ala Asn Ser Gin Phe 

115 120 125 

Thr Ser Thr Tyr He Leu Thr Ala Met Ala He Asp Arg Tyr Leu Ala 

130 135 140 

Thr Val His Pro He Ser Ser Thr Lys Phe Arg Lys Pro Ser Met Ala 
145 150 155 160 

Thr Leu Val He Cys Leu Leu Trp Ala Leu Ser Phe He Ser He Thr 
165 170 175 
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Pro 


Val 


Trp 


Leu 


Tyr 


Ala 


Arg 


Leu 








180 










Gly 


Cys 


Gly 


He 


Arg 


Leu 


Pro 


Asn 






195 










200 


Thr 


Leu 


Tyr 


Gin 


Phe 


Phe 


Leu 


Ala 




210 








215 




Thr 


Ala 


Ala 


Tyr 


Val 


Lys 


He 


Leu 


225 










230 






Pro 


Ala 


Ser 


Gin 


Arg 


Ser 


He 


Arg 










245 








Thr 


Ala 


lie 


Ala 


He 


Cys 


Leu 


Val 








260 








Tyr 


Val 


Leu 


Gin 


Leu 


Thr 


Gin 


Leu 






275 










280 


Phe 


Val 


Tyr 


Leu 


Tyr 


Asn 


Ala 


Ala 




290 










295 




Cys 


Leu 


Asn 


Pro 


Phe 


Val 


Tyr 


He 


305 










310 




Arg 


Leu 


Val 


Leu 


Ser 


Val 


Lys 


Pro 










325 






Val 


Ser 


Asn 


Ala 


Gin 


Thr 


Ala 


Asp 








340 










Thr 


Met 


Val 


Ser 


Lys 


Gly 


Glu 


Glu 






355 










360 


Leu 


Val 


Glu 


Leu 


Asp 


Gly 


Asp 


Val 




370 










375 




Gly 


Glu 


Gly 


Glu 


Gly 


Asp 


Ala 


Thr 


385 










390 






He 


Cys 


Thr 


Thr 


Gly 


Lys 


Leu 


Pro 










405 








Thr 


Leu 


Thr 


Tyr 


Gly 


Val 


Gin 


Cys 








420 










Lys 


Gin 


His 


Asp 


Phe 


Phe 


Lys 


Ser 






435 










440 


Glu 


Arg 


Thr 


He 


Phe 


Phe 


Lys 


Asp 




450 










455 




Glu 


Val 


Lys 


Phe 


Glu 


Gly 


Asp 


Thr 


465 










470 






Gly 


He 


Asp 


Phe 


Lys 


Glu 


Asp 


Gly 










485 








Tyr 


Asn 


Tyr 


Asn 


Ser 


His 


Asn 


Val 








500 










Asn 


Gly 


He 


Lys 


Val 


Asn 


Phe 


Lys 






515 










520 


Ser 


Val 


Gin 


Leu 


Ala 


Asp 


His 


Tyr 




530 








535 


Gly 


Pro 


Val 


Leu 


Leu 


Pro 


Asp 


Asn 


545 










550 




Leu 


Ser 


Lys 


Asp 


Pro 


Asn 


Glu 


Lys 










565 








Phe 


Val 


Thr 


Ala 


Ala 


Gly 


He 


Thr 



580 



<210> 13 
<211> 604 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> MCHlR-linker-EGFP 
<400> 13 



He Pro 


Phe Pro Gly Gly 


Ala 


Val 


185 


190 






Pro Asp 


Thr Asp Leu Tyr 


Trp 


Phe 




205 






Phe Ala 


Leu Pro Phe Val 


Val 


He 




220 






Gin Arg 


Met Thr Ser Ser 


Val 


Ala 




235 




240 


Leu Arg 


Thr Lys Arg Val 


Thr 


Arg 


250 




255 




Phe Phe 


Val Cys Trp Ala 


Pro 


Tyr 


265 


270 






Ser He 


Ser Arg Pro Thr 


Leu 


Thr 




285 






He Ser 


Leu Gly Tyr Ala 


Asn 


Ser 




300 






Val Leu 


Cys Glu Thr Phe 


Arg 


Lys 




315 




320 


Ala Ala 


Gin Gly Gin Leu 


Arg 


Thr 


330 




335 




Glu Glu 


Arg Thr Glu Ser 


Lys 


Gly 


345 


350 






Leu Phe 


Thr Gly Val Val 


Pro 


He 




365 






Asn Gly 


His Lys Phe Ser 


Val 


Ser 


380 






Tyr Gly 


Lys Leu Thr Leu 


Lys 


Phe 




395 




400 


Val Pro 


Trp Pro Thr Leu 


Val 


Thr 


410 




415 




Phe Ser 


Arg Tyr Pro Asp 


His 


Met 


425 


430 






Ala Met 


Pro Glu Gly Tyr 


Val 


Gin 




445 






Asp Gly 


Asn Tyr Lys Thr 


Arg 


Ala 




460 






Leu Val 


Asn Arg He Glu 


Leu 


Lys 




475 




480 


Asn He 


Leu Gly His Lys 


Leu 


Glu 


490 




495 




Tyr He 


Met Ala Asp Lys 


Gin 


Lys 


505 


510 






He Arg 


His Asn He Glu 


Asp 


Gly 




525 






Gin Gin 


Asn Thr Pro He 


Gly 


Asp 




540 


His Tyr 


Leu Ser Thr Gin 


Ser 


Ala 




555 




560 


Arg Asp 


His Met Val Leu 


Leu 


Glu 


570 




575 




Leu Gly 


Met Asp Glu Leu 


Tyr 


Lys 


585 


590 
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Met Asp Leu Glu Ala Ser Leu Leu Pro Thr Gly Pro Asn Ala Ser Asn 

15 10 15 

Thr Ser Asp Gly Pro Asp Asn Leu Thr Ser Ala Gly Ser Pro Pro Arg 

20 25 30 

Thr Gly Ser He Ser Tyr He Asn He He Met Pro Ser Val Phe Gly 

35 40 45 

Thr He Cys Leu Leu Gly He He Gly Asn Ser Thr Val He Phe Ala 

50 55 60 

Val Val Lys Lys Ser Lys Leu His Trp Cys Asn Asn Val Pro Asp He 
65 70 75 80 

Phe He He Asn Leu Ser Val Val Asp Leu Leu Phe Leu Leu Gly Met 

85 90 95 

Pro Phe Met He His Gin Leu Met Gly Asn Gly Val Trp His Phe Gly 

100 105 HO 

Glu Thr Met Cys Thr Leu He Thr Ala Met Asp Ala Asn Ser Gin Phe 

115 120 125 

Thr Ser Thr Tyr He Leu Thr Ala Met Ala He Asp Arg Tyr Leu Ala 

130 135 140 

Thr Val His Pro He Ser Ser Thr Lys Phe Arg Lys Pro Ser Met Ala 
145 150 155 160 

Thr Leu Val He Cys Leu Leu Trp Ala Leu Ser Phe He Ser He Thr 

165 170 175 

Pro Val Trp Leu Tyr Ala Arg Leu He Pro Phe Pro Gly Gly Ala Val 

180 185 190 

Gly Cys Gly He Arg Leu Pro Asn Pro Asp Thr Asp Leu Tyr Trp Phe 

195 200 205 

Thr Leu Tyr Gin Phe Phe Leu Ala Phe Ala Leu Pro Phe Val Val He 

210 215 220 

Thr Ala Ala Tyr Val Lys He Leu Gin Arg Met Thr Ser Ser Val Ala 
225 230 235 240 

Pro Ala Ser Gin Arg Ser He Arg Leu Arg Thr Lys Arg Val Thr Arg 

245 250 255 

Thr Ala He Ala He Cys Leu Val Phe Phe Val Cys Trp Ala Pro Tyr 

260 265 270 

Tyr Val Leu Gin Leu Thr Gin Leu Ser He Ser Arg Pro Thr Leu Thr 

275 280 285 

Phe Val Tyr Leu Tyr Asn Ala Ala He Ser Leu Gly Tyr Ala Asn Ser 

290 295 300 

Cys Leu Asn Pro Phe Val Tyr He Val Leu Cys Glu Thr Phe Arg Lys 
305 310 315 320 

Arg Leu Val Leu Ser Val Lys Pro Ala Ala Gin Gly Gin Leu Arg Thr 

325 330 335 

Val Ser Asn Ala Gin Thr Ala Asp Glu Glu Arg Thr Glu Ser Lys Gly 

340 345 350 

Thr Val Asp Gly Thr Ala Gly Pro Gly Ser He Ala Thr Met Val Ser 

355 360 365 

Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val Glu Leu 

370 375 380 

Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu 
385 390 395 400 

Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys Thr Thr 

405 410 415 

Gly Lys Leu Pro Val Pro Tip Pro Thr Leu Val Thr Thr Leu Thr Tyr 

420 425 430 

Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His Asp 

435 440 445 

Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr He 

450 455 460 

Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe 
465 470 475 480 

Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He Asp Phe 
485 490 495 
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Lys 


Glu 


Asp 


Gly Asn 


He Leu Gly His 








500 




505 


Ser 


His 


Asn 


Val 


Tyr 


He Met Ala Asp 






515 




520 


Val 


Asn 


Phe 


Lys 


He 


Arg His Asn He 




530 








535 


Ala 


Asp 


His 


Tyr Gin 


Gin Asn Thr Pro 


545 










550 


Leu 


Pro 


Asp 


Asn 


His 


Tyr Leu Ser Thr 










565 




Pro 


Asn 


Glu 


Lys 


Arg 


Asp His Met Val 








580 




585 


Ala 


Gly 


lie 


Thr 


Leu 


Gly Met Asp Glu 






595 






600 



<210> 14 
<211> 673 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> MCH1R- 1 inker -EGFP 



<400> 14 



Met 


Ser 


Val 


Gly 


Ala 


Met 


Lys Lys Gly 


1 








5 




Gly 


Gly 


Gly 


Ser 


Gly 


Cys 


Gin Ala Thr 








20 






25 


Cys 


Gly 


Ala 


Cys 


Ala 


Pro 


Gly Gin Gly 






35 








40 


Gin 


Pro 


Ala 


Trp 


Val 


Glu 


Gly Ser Ser 




50 










55 


Thr 


Gly 


Thr 


Gly 


Trp 


Met 


Asp Leu Glu 


65 










70 




Pro 


Asn 


Ala 


Ser 


Asn 


Thr 


Ser Asp Gly 










85 




Gly 


Ser 


Pro 


Pro 


Arg 


Thr 


Gly Ser He 








100 






105 


Pro 


Ser 


Val 


Phe 


Gly 


Thr 


He Cys Leu 






115 






120 


Thr 


Val 


He 


Phe 


Ala 


Val 


Val Lys Lys 




130 










135 


Asn 


Val 


Pro 


Asp 


He 


Phe 


He He Asn 


145 








150 




Phe 


Leu 


Leu 


Gly 


Met 


Pro 


Phe Met He 








165 






Val 


Trp 


His 


Phe 


Gly 


Glu 


Thr Met Cys 








180 






185 


Ala 


Asn 


Ser 


Gin 


Phe 


Thr 


Ser Thr Tyr 






195 








200 


Asp 


Arg 


Tyr 


Leu 


Ala 


Thr 


Val His Pro 




210 










215 


Lys 


Pro 


Ser 


Met 


Ala 


Thr 


Leu Val He 


225 










230 




Phe 


He 


Ser 


He 


Thr 


Pro 


Val Trp Leu 










245 






Pro 


Gly 


Gly 


Ala 


Val 


Gly 


Cys Gly He 








260 






265 


Asp 


Leu 


Tyr 


Trp 


Phe 


Thr 


Leu Tyr Gin 






275 








280 


Pro 


Phe 


Val 


Val 


He 


Thr 


Ala Ala Tyr 




290 










295 




PCTAJS01/08071 



Lys Leu 


Glu Tyr Asn 


Tyr 


Asn 




510 






Lys Gin 


Lys Asn Gly 


He 


Lys 




525 






Glu Asp 


Gly Ser Val 


Gin 


Leu 










He Gly 


Asp Gly Pro 


Val 


Leu 


555 






560 


Gin Ser 


Ala Leu Ser 


Lys 


Asp 


570 




575 


Leu Leu 


Glu Phe Val 


Thr 


Ala 




590 






Leu Tyr 


Lys 






Val Gly 


Arg Ala Val 


Gly 


Leu 


10 




15 




Glu Glu 


Asp Pro Leu 


Pro 


Asn 




30 






Gly Arg 


Arg Trp Arg 


Leu 


Pro 




45 






Ala Arg 


Leu Trp Glu 


Gin 


Ala 




60 






Ala Ser 


Leu Leu Pro 


Thr 


Gly 


75 






80 


Pro Asp 


Asn Leu Thr 


Ser 


Ala 


90 




95 




Ser Tyr 


He Asn He 


He 


Met 




110 






Leu Gly 


He He Gly 


Asn 


Ser 




125 






Ser Lys 


Leu His Trp 


Cys 


Asn 




140 






Leu Ser 


Val Val Asp 


Leu 


Leu 


155 




160 


His Gin 


Leu Met Gly 


Asn 


Gly 


170 




175 




Thr Leu 


He Thr Ala 


Met 


Asp 




190 






He Leu 


inr Axa wee 


Axa 


±ie 




205 






He Ser 


Ser Thr Lys 


Phe 


Arg 




220 






Cys Leu 


Leu Trp Ala 


Leu 


Ser 


235 






240 


Tyr Ala 


Arg Leu He 


Pro 


Phe 


250 




255 




Arg Leu 


Pro Asn Pro 


Asp 


Thr 




270 




Phe Phe 


Leu Ala Phe 


Ala 


Leu 




285 






Val Lys 


lie Leu Gin 

inn 


Arg 


Met 



300 
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Thr Ser Ser Val Ala Pro Ala Ser Gin Arg Ser He Arg Leu Arg Thr 
305 310 315 320 

Lys Arg Val Thr Arg Thr Ala He Ala He Cys Leu Val Phe Phe Val 

325 330 335 

Cys Trp Ala Pro Tyr Tyr Val Leu Gin Leu Thr Gin Leu Ser He Ser 

340 345 350 

Arg Pro Thr Leu Thr Phe Val Tyr Leu Tyr Asn Ala Ala He Ser Leu 

355 360 365 

Gly Tyr Ala Asn Ser Cys Leu Asn Pro Phe Val Tyr He Val Leu Cys 

370 375 380 

Glu Thr Phe Arg Lys Arg Leu Val Leu Ser Val Lys Pro Ala Ala Gin 
385 390 395 400 

Gly Gin Leu Arg Thr Val Ser Asn Ala Gin Thr Ala Asp Glu Glu Arg 

405 410 415 

Thr Glu Ser Lys Gly Thr Val Asp Gly Thr Ala Gly Pro Gly Ser He 

420 425 430 

Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro 

435 440 445 

He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val 

450 455 460 

Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys 
465 ~ 470 * 475 480 

Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val 

485 490 495 

Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His 

500 505 510 

Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val 

515 " 520 525 

Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg 

530 535 540 

Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu 
545 550 555 560 

Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu 

565 570 575 

Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin 

580 585 590 

Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp 

595 600 605 

Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly 

610 615 620 

Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser 
625 630 635 640 

Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu 

645 650 655 

Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr 
660 665 670 

Lys 



<210> 15 
<211> 1269 
<212> DNA 
<213> Human 

<400> 15 

atgtcagtgg gagccatgaa gaagggagtg gggagggcag ttgggcttgg aggcggcagc 60 

ggctgccagg ctacggagga agaccccctt cccaactgcg gggcttgcgc tccgggacaa 120 

ggtggcaggc gctggaggct gccgcagcct gcgtgggtgg aggggagctc agctcggttg 180 

tgggagcagg cgaccggcac tggctggatg gacctggaag cctcgctgct gcccactggt 240 

cccaacgcca gcaacacctc tgatggcccc gataacctca cttcggcagg atcacctcct 300 

cgcacgggga gcatctccta catcaacatc atcatgcctt cggtgttcgg caccatctgc 360 

ctcctgggca tcatcgggaa ctccacggtc atcttcgcgg tcgtgaagaa gtccaagctg 420 



-14- 



WO 01/68706 



PCT/US01/08071 



cactggtgca acaacgtccc cgacatcttc atcatcaacc tctcggtagt agatctcctc 480 

tttctcctgg gcatgccctt catgatccac cagctcatgg gcaatggggt gtggcacttt 540 

ggggagacca tgtgcaccct catcacggcc atggatgcca atagtcagtt caccagcacc 600 

tacatcctga ccgccatggc cattgaccgc tacctggcca ctgtccaccc catctcttcc 660 

acgaagttcc ggaagccctc tgtggccacc ctggtgatct gcctcctgtg ggccctctcc 720 

ttcatcagca tcacccctgt gtggctgtat gccagactca tccccttccc aggaggtgca 780 

gtgggctgcg gcatacgcct gcccaaccca gacactgacc tctactggtt caccctgtac 840 

cagtttttcc tggcctttgc cctgcctttt gtggtcatca cagccgcata cgtgaggatc 900 

ctgcagcgca tgacgtcctc agtggccccc gcctcccagc gcagcatccg gctgcggaca 960 

aagagggtga cccgcacagc catcgccatc tgtctggtct tctttgtgtg ctgggcaccc 1020 

tactatgtgc tacagctgac ccagttgtcc atcagccgcc cgaccctcac ctttgtctac 1080 

ttatacaatg cggccatcag cttgggctat gccaacagct gcctcaaccc ctttgtgtac 1140 

atcgtgctct gtgagacgtt ccgcaaacgc ttggtcctgt cggtgaagcc tgcagcccag 1200 

gggcagcttc gcgctgtcag caacgctcag acggctgacg aggagaggac agaaagcaaa 1260 

ggcacctga 1269 

<210> 16 
<211> 1062 
<212> DMA 
<213> Human 

<400> 16 

atggacctgg aagcctcgct gctgcccact ggtcccaatg ccagcaacac ctctgatggc 60 

cccgataacc tcacttcggc aggatcacct cctcgcacgg ggagcatctc ctacatcaac 120 

atcatcatgc cttcggtgtt cggcaccatc tgcctcctgg gcatcatcgg gaactccacg 180 

gtcatcttcg cggtcgtgaa gaagtccaag ctgcactggt gcaacaacgt ccccgacatc 240 

ttcatcatca acctctcggt agtagatctc ctctttctcc tgggcatgcc cttcatgatc 300 

caccagctca tgggcaatgg ggtgtggcac tttggggaga ccatgtgcac cctcatcacg 360 

gccatggatg ccaatagtca gttcaccagc acctacatcc tgaccgccat ggccattgac 420 

cgctacctgg ccactgtcca ccccatctct tccacgaagt tccggaagcc ctctgtggcc 480 

accctggtga tctgcctcct gtgggccctc tccttcatca gcatcacccc tgtgtggctg 540 

tatgccagac tcatcccctt cccaggaggt gcagtgggct gcggcatacg cctgcccaac 600 

ccagacactg acctctactg gttcaccctg taccagtttt tcctggcctt tgccctgcct 660 

tttgtggtca tcacagccgc atacgtgagg atcctgcagc gcatgacgtc ctcagtggcc 720 

cccgcctccc agcgcagcat ccggctgcgg acaaagaggg tgacccgcac agccatcgcc 780 

atctgtctgg tcttctttgt gtgctgggca ccctactatg tgctacagct gacccagttg 840 

tccatcagcc gcccgaccct cacctttgtc tacttataca atgcggccat cagcttgggc 900 

tatgccaaca gctgcctcaa cccctttgtg tacatcgtgc tctgtgagac gttccgcaaa 960 

cgcttggtcc tgtcggtgaa gcctgcagcc caggggcagc ttcgcgctgt cagcaacgct 1020 

cagacggctg acgaggagag gacagaaagc aaaggcacct ga 1062 

<210> 17 
<211> 2080 
<212> DNA 
<213> Mouse 

<220> 

<221> misc_feature 
<222> (1) . . . (2080) 
<223> n = A,T,C or G 

<400> 17 

ggcggtagag gaagaccctt ttctggactg cggggctcaa gctccggaca aggcggtgga 60 

gggcgctgga ggctgccgca gcctgcgtgg gtggacgggc gctccactcc agggagcagg 120 

cgacctgcac cggctgcatg gatctgcaag cctcgttgct gtccactggc cccaatgcca 180 

gcaacatctc cgatggccag gataatttca cattggcggg gccacctcct cgcacaagga 240 

gtgtctccta catcaacatc atcatgcctt cagtgtttgg taccatctgt ctcctgggca 300 

ttgtgggaaa ctccacagtc atttttgccg tggtgaagaa atccaagctg cactggtgca 360 

gcaacgtccc tgacatcttc atcatcaacc tctctgtggt ggatctgctt ttcctgctgg 420 

gcatgccttt catgatccac cagctcatgg gtaatggtgt ctggcacttt ggggaaacca 480 

tgtgcaccct catcacagcc atggacgcca acagtcagtt caccagcacc tacatcctga 540 

ctgctatggc cattgaccgc tacttggcca ccgtccatcc catctcctcc accaagttcc 600 
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ggaagccctc 
tcactcctgt 
gcatccgcct 
tggccttcgc 
tgacgtcttc 
cccgcacagc 
tgcagctgac 
cggccatcag 
gtgagacctt 
gcacggtcag 
aatccccccc 
gataaacccg 
ccacattcca 
cctcagactg 
tgactggttc 
ggcggaggtg 
ggctctactg 
gaaggcagtc 
tcctgtgcgg 
gagcccacag 
gagaatgaca 
acagtgacag 
gagcatccat 
ctttgtgctt 
cccaaaaaag 



catggccacc 
gtggctctat 
accaaaccca 
ccttccgttt 
ggtggcccca 
cattgccatc 
ccagttgtcc 
cttgggctat 
tcgaaaacgc 
caatgctcag 
ggtcacctcc 
gggctaccct 
cggggcccac 
cagggatccc 
tcggggtctg 
cctagctggg 
tgagactggg 
tttctcccaa 
atttcacgtt 
gagtttaaaa 
aggcaacctg 
cgacattctc 
gaattctgca 
gagccttccn 
gggnaaaaaa 



ctggtgatct 
gccaggctta 
gatactgatc 
gtggtcatca 
gcctctcaac 
tgtctggtct 
atcagccgcc 
gccaacagct 
ttggtgctgt 
acagctgacg 
aagtcaggtc 
gggaggatgc 
aaattgctag 
cttgacagaa 
tatctgttgg 
ttcctttaaa 
caagccgagc 
gctggtggat 
ctctttaggg 
aaccaaaaga 
gggtggatgt 
ttcatagggc 
tgtgcagggg 
ctcccaaatg 
aaaaaaaaaa 



gcctcctgtg 
tccccttccc 
tttactggtt 
ctgctgcgta 
gcagcatccg 
tctttgtgtg 
cgaccctcac 
gcctcaatcc 
cggtgaagcc 
aggagaggac 
accgcatcaa 
agaagctgga 
ggaggcttgc 
tagaagcgga 
ctcgcatata 
actaggcagg 
gttccctccc 
ctcctgaagc 
gatgcatgtt 
gctcagagtg 
ggatcttgaa 
acagctgtca 
tcactctaat 
ggaatgaaat 
aaaaaaaaaa 



ggctctctcg 
agggggtgct 
cactctgtat 
cgtgaaaata 
gcttcggaca 
ctgggcgccc 
attcgtctac 
ctttgtgtac 
cgcggcccag 
agaaagcaaa 
accatgggga 
ggctgggggc 
agcctggttt 
gcaagaagga 
tctttctctc 
gctaggatct 
atctctcatt 
acgctgcctg 
gacactgggg 
tcgagagaga 
actaataaaa 
gtctatggct 
acctgatatg 
aaaggcaaat 



ttcattagca 
gtgggctgtg 
cagtttttcc 
ctacagcgca 
aagagggtga 
tactacgtgc 
ctgtacaatg 
atagtactct 
gggcagcttc 
ggcacctgac 
gagatactga 
ttgtagcaaa 

gggggggaag 

aagggtggtt 
aagggaagaa 
gagcagctag 
ggtgttgata 
ggctccagca 
tgtgggctct 
cccaatcacc 
aggggttttc 
gatccagagc 
ttggcatcat 
tcccnccccc 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2080 



<210> 18 
<211> 3357 
<212> DNA 
<213> Mouse 



<220> 

<221> misc_ feature 
<222> (1) . . . (3357) 
<223> n = A,T,C or G 



<400> 18 

ggcggtagag 

gggcgctgga 

cgacctgcac 

gcaacatctc 

cctcctccgg 

gaaactttat 

gaggagagga 

aggcacattt 

cttgctgatg 

aagcaagcaa 

ctttgattca 

gagtggtcct 

agaggattaa 

aaattctagc 

attcccaagc 

ctcaggagac 

agcctgaaac 

atctgaaacc 

cacacacaca 

aagccttttt 

atctgggagt 

cctctgagta 

gggtaaggta 

gacgtagtga 

caggttgtca 



gaagaccctt 
ggctgccgca 
cggctgcatg 
cgatggccag 
gatgggtgtg 
cttacagttc 
ggggacacca 
agaaacaaca 
agtcccaaat 
gcaagacagc 
ccgctatcct 
tgctgtcttc 
cttttctggt 
tctgaaggag 
ttctggggtg 
atggtccaag 
acaccccaca 
tttggccatc 
cacacacaca 
gacaaggttt 
cttgactcct 
acaggtggat 
cagaactaac 
ccccaagctg 
gcttctagat 



ttctggactg 
gcctgcgtgg 
gatctgcaag 
gataatttca 
gaaaatggga 
ttggtgataa 
gctaggacag 
agggttggtg 
atgctttgca 
aagacagcgg 
gggctcaatc 
tggcaaaggc 
gaattaagct 
aagggaatga 
taacacagct 
gaaatccctg 
cccatggtga 
tctctctgaa 
cacacacaca 
tctggtggat 
ggccctctga 
ctagaaaatc 
aatagcccga 
caagggaaag 
cggaaggcgg 



cggggctcaa 
gtggacgggc 
cctcgttgct 
cattggcggg 
aggtttcacc 
gatctgcagt 
aaggggcagg 
acaagacgtg 

gggggggggg 
gcgggtagta 
ctctggcctc 
tgctgggagc 
tcttgacatt 
aggggaaaga 
ccagtcccta 
acagaaaacc 
cagtcacttc 
aagatgaggc 
cacacacaca 
cccggggata 
gtaaatggat 
ctataggagt 
gaaggggaaa 
catgaggggc 
gtcacacttg 



gctccggaca 
gctccactcc 
gtccactggc 
tgagtcgagt 
tcccaagcca 
cggctttgcc 
gaggaataga 
aggcaggctt 

ggggggaatc 
tgtgggagcc 
gcactgggga 
aaaagacttc 
tgcagaacgt 
gggaaggttg 
ccctattggg 
gggagagggc 
tcacatatgc 
tgcaaataca 
cacacacaca 
tgaagttgtt 
gaagcgaaga 
caccagggca 
cagcaggaga 
cagcaggaag 
ctctttctat 



aggcggtgga 
agggagcagg 
cccaatgcca 
tggagtcctc 
aactgcctgg 
tgaagaggaa 
gatggggcag 
gaggggaaag 
aaggctggag 
agcagaagcg 
aatggggtct 
acagggcgtg 
caatgcctta 
gtgtggagaa 
aaagcccaga 
agggctgtgg 
ctaggaacct 
cacacacaca 
aatgtccttc 
ctcagcagat 
agaatggggt 
cggtggagga 
tgattccaga 
gccgacatgg 
cctcagggcc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
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acctcctcgc acaaggagtg tctcctacat caacatcatc atgccttcag tgtttggtac 1560 

catctgtctc ctgggcattg tgggaaactc cacagtcatt tttgccgtgg tgaagaaatc 1620 

f*f gC ^* C ^Scagca acgtccctga catcttcatc atcaacctct ctgtggtgga 1680 

tctgcttttc ctgctgggca tgcctttcat gatccaccag ctcatgggta atggtgtctg 1740 

gcactttggg gaaaccatgt gcaccctcat cacagccatg gacgccaaca gtcagttcac 1800 

cagcacctrac atcctgactg ctatggccat tgaccgctac ttggccaccg tccatcccat 1860 

ctcctccacc aagttccgga agccctccat ggccaccctg gtgatctgcc tcctgtgggc 1920 

tctctcgttc attagcatca ctcctgtgtg gctctatgcc aggcttatcc ccttcccagg 1980 

gggtgctgtg ggctgtggca tccgcctacc aaacccagat actgatcttt actggttcac 2040 

tctgtatcag tttttcctgg ccttcgccct tccgtttgtg gtcatcactg ctgcgtacgt 2100 

gaaaatacta cagcgcatga cgtcttcggt ggccccagcc tctcaacgca gcatccggct 2160 

tcggacaaag agggtgaccc gcacagccat tgccatctgt ctggtcttct ttgtgtgctg 2220 

ggcgccctac tacgtgctgc agctgaccca gttgtccatc agccgcccga ccctcacatt 2280 

cgtctacctg tacaatgcgg ccatcagctt gggctatgcc aacagctgcc tcaatccctt 2340 

tgtgtacata gtactctgtg agacctttcg aaaacgcttg gtgctgtcgg tgaagcccgc 2400 

ggcccagggg cagcttcgca cggtcagcaa tgctcagaca gctgacgagg agaggacaga 2460 

aagcaaaggc acctgacaat cccccccggt cacctccaag tcaggtcacc gcatcaaacc 2520 

atggggagag atactgagat aaacccgggg ctaccctggg aggatgcaga agctggaggc 2580 

tgggggcttg tagcaaacca cattccacgg ggcccacaaa ttgctaggga ggcttgcagc 2640 

ctggtttggg ggggaagcct cagactgcag ggatcccctt gacagaatag aagcggagca 2700 

agaaggaaag ggtggtttga ctggttctcg gggtctgtat ctgttggctc gcatatatct 2760 

ttctctcaag ggaagaaggc ggaggtgcct agctgggttc ctttaaaact aggcagggct 2820 

aggatctgag cagctagggc tctactgtga gactgggcaa gccgagcgtt ccctcccatc 2880 

tctcattggt gttgatagaa ggcagtcttt ctcccaagct ggtggatctc ctgaagcacg 2940 

ctgcctgggc tccagcatcc tgtgcggatt tcacgttctc tttaggggat gcatgttgac 3000 

actggggtgt gggctctgag cccacaggag tttaaaaaac caaaagagct cagagtgtcg 3060 

agagagaccc aatcaccgag aatgacaagg caacctgggg tggatgtgga tcttgaaact 3120 

aataaaaagg ggttttcaca gtgacagcga cattctcttc atagggcaca gctgtcagtc 3180 

tatggctgat ccagagcgag catccatgaa ttctgcatgt gcaggggtca ctctaatacc 3240 

tgatatgttg gcatcatctt tgtgcttgag ccttccnetc ccaaatggga atgaaataaa 33 00 

ggcaaattcc cncccccccc aaaaaagggg naaaaaaaaa aaaaaaaaaa aaaaaaa 3357 

<210> 19 
<211> 1062 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Htiman short form/mouse species chimeric MCH1R 
<400> 19 

atggacctgg aagcctcgct gctgcccact ggtcccaatg ccagcaacac ctctgatggc 60 

-?SS?Sf CC t ?f° tt f g ?f a ^ afcc acct cctcgcacgg ggagcatctc ctacatcaac 120 

atcatcatgc cttcggtgtt cggcaccatc tgcctcctgg gcatcatcgg gaactccacg 180 

gtcatcttcg cggtcgtgaa gaagtccaag ctgcactggt gcaacaacgt ccccgacatc 240 

ttcatcatca acctctcggt agtagatctc ctctttctcc tgggcatgcc cttcatgatc 300 

caccagctca tgggcaatgg ggtgtggcac tttggggaga ccatgtgcac cctcatcacg 360 

gccatggatg ccaatagtca gttcaccagc acctacatcc tgaccgccat ggccattgac 420 

cgctacctgg ccactgtcca ccccatctct tccacgaagt tccggaagcc ctccatggcc 480 

accctggtga tctgcctcct gtgggctctc tcgttcatta gcatcactcc tgtgtggctc 540 

JS-S-S 3 ? 0 t ^ a Hf^ cctt cccagggggt gctgtgggct gtggcatccg cctaccaaac 600 

SfSSS? f tct t ta ftg gttcactctg tatcagtttt tcctggcctt cgcccttccg 660 

tttgtggtca tcactgctgc gtacgtgaaa atactacagc gcatgacgtc ttcggtggcc 720 

aZrfnZlt™ c ^ cttc ^ acaaagaggg tgacccgcac agccattgcc 780 

atctgtctgg tcttctttgt gtgctgggcg ccctactacg tgctgcagct gacccagttg 840 

tccatcagcc gcccgaccct cacattcgtc tacctgtaca atgcggccat cagcttgggc 900 

? c ^ gcc ^ caa tccctttgtg tacatagtac tctgtgagac ctttcgaaaa 960 

S^KlE: t^tcggtgaa gcccgcggcc caggggcagc ttcgcacggt cagcaatgct 1020 

cagacagctg acgaggagag gacagaaagc aaaggcacct ga 10 62 



<210> 20 
<211> 1269 
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<212> DNA 

<213> Artif icial Segufince 
<220> 

<223> Human long form/mouse 

<400> 20 chimeric MCH1R 

2K5£ Sfc a l~1 S55SS ssS^ « 

•Wgcttc £Sf«|" tgK££! £" CMto = ""tftjtal ffJS 

«210> 21 «saa«ocaa« 1260 



cacacacgaa taa 

IS SE S= seas a«p 
SsF Si s= ^ SF -Sis 
§s= Sg S"! Sil S» 

<210> 22 at ttgacttg 960 

<211> 765 966 
<212> DNA 

<2"> Artifice Seguence 
<220> 

<223> GPP derivative 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
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HP SS 5=s ssss — . 

£«« sssss "«■»«« «ss a ? M 5° a, =««ti 

y35ES sS sssS SS fei 

pSB g£ S p£ JSSl 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 

<2liZ % Q "~ " ycg Wcflc - — ^ccg cc gj 

<212> 7 65 

<213> Seguence 

<220> 

< 2 23> GPP derivative 




<220> 

<223> Gpp derivative 
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<213> Artificial Sequence 
<220> 

<223> GFP derivative 



<400> 25 

atggtgagca 

ggcgacgtaa 

ggcaagctga 

ctcgtgacca 

cagcacgact 

ttcaaggacg 

gtgaaccgca 

aagctggagt 

ggcatcaagg 

gaccactacc 

tacctgagca 

ctgctggagt 



agggcgagga 
acggccacag 
ccctgaagtt 
ccctgacctg 
tcttcaagtc 
acggcaacta 
tcgagctgaa 
acaactacat 
cccacttcaa 
agcagaacac 
cccagtccgc 
tcgtgaccgc 



gctgttcacc 
gttcagcgtg 
catctgcacc 
gggcgtgcag 
cgccatgccc 
caagacccgc 
gggcatcgac 
cagccacaac 
gatccgccac 
ccccatcggc 
cctgagcaaa 
cgccgggatc 



ggggtggtgc 
tccggcgagg 
accggcaagc 
tgcttcagcc 
gaaggctacg 
gccgaggtga 
ttcaaggagg 
gtctatatca 
aacatcgagg 
gacggccccg 
gaccccaacg 
actctcggca 



ccatcctggt 
gcgagggcga 
tgcccgtgcc 
gctaccccga 
tccaggagcg 
agttcgaggg 
acggcaacat 
ccgccgacaa 
acggcagcgt 
tgctgctgcc 
agaagcgcga 
tggacgagct 



cgagctggac 
tgccacctac 
ctggcccacc 
ccacatgaag 
taccatcttc 
cgacaccctg 
cctggggcac 
gcagaagaac 
gcagctcgcc 
cgacaaccac 
tcacatggtc 
gtacaagtaa 



<210> 26 
<211> 3092 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Mouse MCH1R- 1 inker-EGFP 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



<400> 26 

atggatctgc 

caggataatt 

gtggaaaatg 

ttcttggtga 

ccagctagga 

acaagggttg 

aatatgcttt 

agcaagacag 

cctgggctca 

ttctggcaaa 

ggtgaattaa 

gagaagggaa 

gtgtaacaca 

aaggaaatcc 

acacccatgg 

atctctctct 

acacacacac 

ttttctggtg 

cctggccctc 

gatctagaaa 

aacaatagcc 

ctgcaaggga 

gatcggaagg 

gtgtctccta 

ttgtgggaaa 

gcaacgtccc 

gcatgccttt 

tgtgcaccct 

ctgctatggc 

ggaagccctc 

tcactcctgt 

gcatccgcct 

tggccttcgc 

tgacgtcttc 



aagcctcgtt 

tcacattggc 

ggaaggtttc 

taagatctgc 

cagaaggggc 

gtgacaagac 

gcaggggggg 

cgggcgggta 

atcctctggc 

ggctgctggg 

gcttcttgac 

tgaaggggaa 

gctccagtcc 

ctgacagaaa 

tgacagtcac 

gaaaagatga 

acacacacac 

gatcccgggg 

tgagtaaatg 

atcctatagg 

cgagaagggg 

aagcatgagg 

cgggtcacac 

catcaacatc 

ctccacagtc 

tgacatcttc 

catgatccac 

catcacagcc 

cattgaccgc 

catggccacc 

gtggctctat 

accaaaccca 

ccttccgttt 

ggtggcccca 



gctgtccact 
gggtgagtcg 
acctcccaag 
agtcggcttt 
agggaggaat 
gtgaggcagg 
ggggggggga 

gtatgtggga 
ctcgcactgg 
agcaaaagac 
atttgcagaa 
agagggaagg 
ctaccctatt 
accgggagag 
ttctcacata 
ggctgcaaat 
acacacacac 
atatgaagtt 
gatgaagcga 
agtcaccagg 
aaacagcagg 
ggccagcagg 
ttgctctttc 
atcatgcctt 
atttttgccg 
atcatcaacc 
cagctcatgg 
atggacgcca 
tacttggcca 
ctggtgatct 
gccaggctta 
gatactgatc 
gtggtcatca 
gcctctcaac 



ggccccaatg 

agttggagtc 

ccaaactgcc 

gcctgaagag 

agagatgggg 

cttgagggga 

atcaaggctg 

gccagcagaa 

ggaaatgggg 

ttcacagggc 

cgtcaatgcc 

ttggtgtgga 

gggaaagccc 

ggcagggctg 

tgcctaggaa 

acacacacac 

acaaatgtcc 

gttctcagca 

agaagaatgg 

gcacggtgga 

agatgattcc 

aaggccgaca 

tatcctcagg 

cagtgtttgg 

tggtgaagaa 

tgtctgtggt 

gtaatggtgt 

acagtcagtt 

ccgtccatcc 

gcctcctgtg 

tccccttccc 

tttactggtt 

ctgctgcgta 

gcagcatccg 



ccagcaacat 

ctccctcctc 

tgggaaactt 

gaagaggaga 

cagaggcaca 

aagcttgctg 

gagaagcaag 

gcgctttgat 

tctgagtggt 

gtgagaggat 

ttaaaattct 

gaaattccca 

agactcagga 

tggagcctga 

cctatctgaa 

acacacacac 

ttcaagcctt 

gatatctggg 

ggtcctctga 

ggagggtaag 

agagacgtag 

tggcaggttg 

gccacctcct 

taccatctgt 

atccaagctg 

ggatctgctt 

ctggcacttt 

caccagcacc 

catctcctcc 

ggctctctcg 

agggggtgct 

cactctgtat 

cgtgaaaata 

gcttcggaca 



ctccgatggc 

cgggatgggt 

tatcttacag 

ggaggggaca 

tttagaaaca 

atgagtccca 

caagcaagac 

tcaccgctat 

ccttgctgtc 

taacttttct 

agctctgaag 

agcttctggg 

gacatggtcc 

aacacacccc 

acctttggcc 

acacacacac 

tttgacaagg 

agtcttgact 

gtaacaggtg 

gtacagaact 

tgaccccaag 

tcagcttcta 

cgcacaagga 

ctcctgggca 

cactggtgca 

ttcctgctgg 

ggggaaacca 

tacatcctga 

accaagttcc 

ttcattagca 

gtgggctgtg 

cagtttttcc 

ctacagcgca 

aagagggtga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
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cccgcacagc cattgccatc tgtctggtct tctttgtgtg ctgggcgccc tactacgtgc 2100 

tgcagctgac ccagttgtcc atcagccgcc cgaccctcac attcgtctac ctgtacaatg .2160 

cggccatcag cttgggctat gccaacagct gcctcaatcc ctttgtgtac atagtactct 2220 

gtgagacctt tcgaaaacgc ttggtgctgt cggtgaagcc cgcggcccag gggcagcttc 2280 

gcacggtcag caatgctcag acagctgacg aggagaggac agaaagcaaa ggcaccgtcg 2340 

acggtaccgc gggcccggga tccatcgcca ccatggtgag caagggcgag gagctgttca 2400 

ccggggtggt gcccatcctg gtcgagctgg acggcgacgt aaacggccac aagttcagcg 2460 

tgtccggcga gggcgagggc gatgccacct acggcaagct gaccctgaag ttcatctgca 2520 

ccaccggcaa gctgcccgtg ccctggccca ccctcgtgac caccctgacc tacggcgtgc 2580 

agtgcttcag ccgctacccc gaccacatga agcagcacga cttcttcaag tccgccatgc 2640 

ccgaaggcta cgtccaggag cgcaccatct tcttcaagga cgacggcaac tacaagaccc 2700 

gcgccgaggt gaagttcgag ggcgacaccc tggtgaaccg catcgagctg aagggcatcg 2760 

acttcaagga ggacggcaac atcctggggc acaagctgga gtacaactac aacagccaca 2820 

acgtctatat catggccgac aagcagaaga acggcatcaa ggtgaacttc aagatccgcc 2880 

acaacatcga ggacggcagc gtgcagctcg ccgaccacta ccagcagaac acccccatcg 2940 

gcgacggccc cgtgctgctg cccgacaacc actacctgag cacccagtcc gccctgagca 3000 

aagaccccaa cgagaagcgc gatcacatgg tcctgctgga gttcgtgacc gccgccggga 3060 

tcactctcgg catggacgag ctgtacaagt aa 3092 

<210> 27 
<211> 3056 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Mouse MCH1R/EGFP direct fusion 
<400> 27 

atggatctgc aagcctcgtt gctgtccact ggccccaatg ccagcaacat ctccgatggc 60 

caggataatt tcacattggc gggtgagtcg agttggagtc ctccctcctc cgggatgggt 120 

gtggaaaatg ggaaggtttc acctcccaag ccaaactgcc tgggaaactt tatcttacag 180 

ttcttggtga taagatctgc agtcggcttt gcctgaagag gaagaggaga ggaggggaca 240 

ccagctagga cagaaggggc agggaggaat agagatgggg cagaggcaca tttagaaaca 300 

acaagggttg gtgacaagac gtgaggcagg cttgagggga aagcttgctg atgagtccca 360 

aatatgcttt gcaggggggg ggggggggga atcaaggctg gagaagcaag caagcaagac 420 

agcaagacag cgggcgggta gtatgtggga gccagcagaa gcgctttgat tcaccgctat 480 

cctgggctca atcctctggc ctcgcactgg ggaaatgggg tctgagtggt ccttgctgtc 540 

ttctggcaaa ggctgctggg agcaaaagac ttcacagggc gtgagaggat taacttttct 600 

ggtgaattaa gcttcttgac atttgcagaa cgtcaatgcc ttaaaattct agctctgaag 660 

gagaagggaa tgaaggggaa agagggaagg ttggtgtgga gaaattccca agcttctggg 720 

gtgtaacaca gctccagtcc ctaccctatt gggaaagccc agactcagga gacatggtcc 780 

aaggaaatcc ctgacagaaa accgggagag ggcagggctg tggagcctga aacacacccc 840 

acacccatgg tgacagtcac ttctcacata tgcctaggaa cctatctgaa acctttggcc 900 

atctctctct gaaaagatga ggctgcaaat acacacacac acacacacac acacacacac 960 

acacacacac acacacacac acacacacac acaaatgtcc ttcaagcctt tttgacaagg 1020 

ttttctggtg gatcccgggg atatgaagtt gttctcagca gatatctggg agtcttgact 1080 

cctggccctc tgagtaaatg gatgaagcga agaagaatgg ggtcctctga gtaacaggtg 1140 

gatctagaaa atcctatagg agtcaccagg gcacggtgga ggagggtaag gtacagaact 1200 

aacaatagcc cgagaagggg aaacagcagg agatgattcc agagacgtag tgaccccaag 1260 

ctgcaaggga aagcatgagg ggccagcagg aaggccgaca tggcaggttg tcagcttcta 1320 

gatcggaagg cgggtcacac ttgctctttc tatcctcagg gccacctcct cgcacaagga 1380 

gtgtctccta catcaacatc atcatgcctt cagtgtttgg taccatctgt ctcctgggca 1440 

ttgtgggaaa ctccacagtc atttttgccg tggtgaagaa atccaagctg cactggtgca 1500 

gcaacgtccc tgacatcttc atcatcaacc tctctgtggt ggatctgctt ttcctgctgg 1560 

gcatgccttt catgatccac cagctcatgg gtaatggtgt ctggcacttt ggggaaacca 1620 

tgtgcaccct catcacagcc atggacgcca acagtcagtt caccagcacc tacatcctga 1680 

ctgctatggc cattgaccgc tacttggcca ccgtccatcc catctcctcc accaagttcc 1740 

ggaagccctc catggccacc ctggtgatct gcctcctgtg ggctctctcg ttcattagca 1800 

tcactcctgt gtggctctat gccaggctta tccccttccc agggggtgct gtgggctgtg 1860 

gcatccgcct accaaaccca gatactgatc tttactggtt cactctgtat cagtttttcc 1920 

tggccttcgc ccttccgttt gtggtcatca ctgctgcgta cgtgaaaata ctacagcgca 1980 

tgacgtcttc ggtggcccca gcctctcaac gcagcatccg gcttcggaca aagagggtga 2040 
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cccgcacagc 
tgcagctgac 
cggccatcag 
gtgagacctt 
gcacggtcag 
tgagcaaggg 
acgtaaacgg 
agctgaccct 
tgaccaccct 
acgacttctt 
aggacgacgg 
accgcatcga 
tggagtacaa 
tcaaggtgaa 
actaccagca 
tgagcaccca 
tggagttcgt 



cattgccatc 
ccagttgtcc 
cttgggctat 
tcgaaaacgc 
caatgctcag 
cgaggagctg 
ccacaagttc 
gaagttcatc 
gacctacggc 
caagtccgcc 
caactacaag 
gctgaagggc 
ctacaacagc 
cttcaagatc 
gaacaccccc 
gtccgccctg 
gaccgccgcc 



tgtctggtct 
atcagccgcc 
gccaacagct 
ttggtgctgt 
acagctgacg 
ttcaccgggg 
agcgtgtccg 
tgcaccaccg 
gtgcagtgct 
atgcccgaag 
acccgcgccg 
atcgacttca 
cacaacgtct 
cgccacaaca 
atcggcgacg 
agcaaagacc 
gggatcactc 



tctttgtgtg 
cgaccctcac 
gcctcaatcc 
cggtgaagcc 
aggagaggac 
tggtgcccat 
gcgagggcga 
gcaagctgcc 
tcagccgcta 
gctacgtcca 
aggtgaagtt 
aggaggacgg 
atatcatggc 
tcgaggacgg 
gccccgtgct 
ccaacgagaa 
tcggcatgga 



ctgggcgccc 
attcgtctac 
ctttgtgtac 
cgcggcccag 
agaaagcaaa 
cctggtcgag 
gggcgatgcc 
cgtgccctgg 
ccccgaccac 
ggagcgcacc 
cgagggcgac 
caacatcctg 
cgacaagcag 
cagcgtgcag 
gctgcccgac 
gcgcgatcac 
cgagctgtac 



tactacgtgc 
ctgtacaatg 
atagtactct 
gggcagcttc 
ggcaccatgg 
ctggacggcg 
acctacggca 
cccaccctcg 
atgaagcagc 
atcttcttca 
accctggtga 
gggcacaagc 
aagaacggca 
ctcgccgacc 
aaccactacc 
atggtcctgc 
aagtaa 



<210> 28 
<211> 1815 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Human short form/mouse species chimeric 
MCH1R-1 inker -EGFP 



2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3056 



<400> 28 

atggacctgg 

cccgataacc 

atcatcatgc 

gtcatcttcg 

ttcatcatca 

caccagctca 

gccatggatg 

cgctacctgg 

accctggtga 

tatgccaggc 

ccagatactg 

tttgtggtca 

ccagcctctc 

atctgtctgg 

tccatcagcc 

tatgccaaca 

cgcttggtgc 

cagacagctg 

ggatccatcg 

ctggtcgagc 

ggcgatgcca 

gtgccctggc 

cccgaccaca 

gagcgcacca 

gagggcgaca 

aacatcctgg 

gacaagcaga 

agcgtgcagc 

ctgcccgaca 

cgcgatcaca 

gagctgtaca 

<210> 29 
<211> 2022 



aagcctcgct 
tcacttcggc 
cttcggtgtt 
cggtcgtgaa 
acctctcggt 
tgggcaatgg 
ccaatagtca 
ccactgtcca 
tctgcctcct 
ttatcccctt 
atctttactg 
tcactgctgc 
aacgcagcat 
tcttctttgt 
gcccgaccct 
gctgcctcaa 
tgtcggtgaa 
acgaggagag 
ccaccatggt 
tggacggcga 
cctacggcaa 
ccaccctcgt 
tgaagcagca 
tcttcttcaa 
ccctggtgaa 
ggcacaagct 
agaacggcat 
tcgccgacca 
accactacct 
tggtcctgct 
agtaa 



gctgcccact 
aggatcacct 
cggcaccatc 
gaagtccaag 
agtagatctc 
ggtgtggcac 
gttcaccagc 
ccccatctct 
gtgggctctc 
cccagggggt 
gttcactctg 
gtacgtgaaa 
ccggcttcgg 
gtgctgggcg 
cacattcgtc 
tccctttgtg 
gcccgcggcc 
gacagaaagc 
gagcaagggc 
cgtaaacggc 
gctgaccctg 
gaccaccctg 
cgacttcttc 
ggacgacggc 
ccgcatcgag 
ggagtacaac 
caaggtgaac 
ctaccagcag 
gagcacccag 
ggagttcgtg 



ggtcccaatg 
cctcgcacgg 
tgcctcctgg 
ctgcactggt 
ctctttctcc 
tttggggaga 
acctacatcc 
tccacgaagt 
tcgttcatta 
gctgtgggct 
tatcagtttt 
atactacagc 
acaaagaggg 
ccctactacg 
tacctgtaca 
tacatagtac 
caggggcagc 
aaaggcaccg 
gaggagctgt 
cacaagttca 
aagttcatct 
acctacggcg 
aagtccgcca 
aactacaaga 
ctgaagggca 
tacaacagcc 
ttcaagatcc 
aacaccccca 
tccgccctga 
accgccgccg 



ccagcaacac 
ggagcatctc 
gcatcatcgg 
gcaacaacgt 
tgggcatgcc 
ccatgtgcac 
tgaccgccat 
tccggaagcc 
gcatcactcc 
gtggcatccg 
tcctggcctt 
gcatgacgtc 
tgacccgcac 
tgctgcagct 
atgcggccat 
tctgtgagac 
ttcgcacggt 
tcgacggtac 
tcaccggggt 
gcgtgtccgg 
gcaccaccgg 
tgcagtgctt 
tgcccgaagg 
cccgcgccga 
tcgacttcaa 
acaacgtcta 
gccacaacat 
tcggcgacgg 
gcaaagaccc 
ggatcactct 



ctctgatggc 
ctacatcaac 
gaactccacg 
ccccgacatc 
cttcatgatc 
cctcatcacg 
ggccattgac 
ctccatggcc 
tgtgtggctc 
cctaccaaac 
cgcccttccg 
ttcggtggcc 
agccattgcc 
gacccagttg 
cagcttgggc 
ctttcgaaaa 
cagcaatgct 
cgcgggcccg 
ggtgcccatc 
cgagggcgag 
caagctgccc 
cagccgctac 
ctacgtccag 
ggtgaagttc 
ggaggacggc 
tatcatggcc 
cgaggacggc 
ccccgtgctg 
caacgagaag 
cggcatggac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1815 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Human long form/mouse species chimeric 
MCH1R- 1 inker- EGFP 

<400> 29 

atgtcagtgg gagccatgaa gaagggagtg gggagggcag ttgggcttgg aggcggcagc 60 

ggctgccagg ctacggagga agaccccctt cccaactgcg gggcttgcgc tccgggacaa 120 

ggtggcaggc gctggaggct gccgcagcct gcgtgggtgg aggggagctc agctcggttg 180 

tgggagcagg cgaccggcac tggctggatg gacctggaag cctcgctgct gcccactggt 240 

cccaacgcca gcaacacctc tgatggcccc gataacctca cttcggcagg atcacctcct 300 

cgcacgggga gcatctccta catcaacatc atcatgcctt cggtgttcgg caccatctgc 360 

ctcctgggca tcatcgggaa ctccacggtc atcttcgcgg tcgtgaagaa gtccaagctg 420 

cactggtgca acaacgtccc cgacatcttc atcatcaacc tctcggtagt agate tec tc 480 

tttctcctgg gcatgccctt catgatccac cagctcatgg gcaatggggt gtggcacttt 540 

ggggagacca tgtgcaccct catcacggcc atggatgcca atagtcagtt caccagcacc 600 

tacatcctga ccgccatggc cattgaccgc tacctggcca ctgtccaccc catctcttcc 660 

acgaagttcc ggaagccctc catggccacc ctggtgatct gcctcctgtg ggctctctcg 720 

ttcattagca tcactcctgt gtggctctat gecaggctta tccccttccc agggggtget 780 

gtgggctgtg gcatccgcct accaaaccca gatactgatc tttactggtt cactctgtat 840 

cagtttttcc tggccttcgc ccttccgttt gtggtcatca ctgctgcgta cgtgaaaata 900 

ctacagcgca tgaegtctte ggtggcccca gcctctcaac gcagcatccg getteggaca 960 

aagagggtga cccgcacagc cattgccatc tgtctggtct tctttgtgtg ctgggcgccc 1020 

tactacgtgc tgeagctgae ccagttgtcc atcagccgcc cgaccctcac attegtctae 108,0 

ctgtacaatg cggccatcag cttgggctat gccaacagct gcctcaatcc ctttgtgtac 1140 

atagtactct gtgagacctt tegaaaaege ttggtgctgt eggtgaagee cgcggcccag 1200 

gggcagcttc geaeggtcag caatgetcag acagctgacg aggagaggac agaaagcaaa 1260 

ggcaccgtcg acggtaccgc gggcccggga tccatcgcca ccatggtgag caagggegag 1320 

gagctgttca ccggggtggt gcccatcctg gtcgagctgg aeggegaegt aaacggccac 1380 

aagttcagcg tgtceggega gggegaggge gatgccacct aeggcaaget gaccctgaag 1440 

ttcatctgea ccaccggcaa gctgcccgtg ccctggccca ccctcgtgac caccctgacc 1500 

tacggcgtgc agtgcttcag ccgctacccc gaccacatga agcagcacga cttcttcaag 1560 

tccgccatgc ccgaaggcta cgtccaggag cgcaccatct tcttcaagga cgacggcaac 1620 

tacaagaccc gcgccgaggt gaagttcgag ggcgacaccc tggtgaaccg catcgagctg 1680 

aagggcatcg acttcaagga ggaeggcaac atcctggggc acaagctgga gtacaactac 1740 

aacagccaca aegtctatat catggccgac aagcagaaga aeggcatcaa ggtgaacttc 1800 

aagatccgcc acaacatcga ggaeggcage gtgeagcteg ccgaccacta ccagcagaac 1860 

acccccatcg gcgacggccc cgtgctgctg cccgacaacc actacctgag cacccagtcc 1920 

gccctgagca aagaccccaa egagaagege gatcacatgg tcctgctgga gttcgtgacc 1980 

gccgccggga tcactctcgg catggacgag ctgtacaagt aa 2022 

<210> 30 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Linker 
<400> 30 

tegaeggtae cgcgggcccg ggatccatcg ccacc 35 

<210> 31 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Linker 
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<400> 31 

Val Asp Gly Thr Ala Gly Pro Gly Ser He Ala Thr 
! 5 10 

<210> 32 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 32 

gcgaattcac catggatctg caagcctcg 

<210> 33 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 33 

gcgtcgacgg tgcctttgct ttctgtcc 

<210> 34 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 34 

ccttgctcac catggtgcct ttgctttctg tec 

<210> 35 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> ' 

<223> PCR primer 
<400> 35 

cagaaagcaa aggcaccatg gtgagcaagg gegaggage 

<210> 36 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 36 

ggeggatect etagagtege ggee 

<210> 37 
<211> 38 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 37 

gctctagagt cgcggccgct tacttgtaca gctcgtcc 38 
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